[V1] Support per-request seed #9945

njhill · 2024-11-02T01:37:07Z

And make batch generators sparse

And make batch generators sparse Signed-off-by: Nick Hill <[email protected]>

vllm/v1/worker/gpu_model_runner.py

njhill · 2024-11-02T01:42:13Z

vllm/v1/worker/gpu_model_runner.py

@@ -645,8 +650,7 @@ def make_sampling_metadata(
            top_k=self.top_k[:self.num_reqs],
            no_top_p=self.no_top_p,
            no_top_k=self.no_top_k,
-            generators=self.generators[:self.num_reqs],
-            no_generator=self.no_generator,
+            generators=self.generators,
            max_num_logprobs=self.max_num_logprobs,
        )


@WoosukKwon can we avoid re-creating the SamplingMetadata object here in the case skip_copy is True? Just keep it as a field in the batch ...

Yeah that could be a great idea, though I think the current code doesn't hurt performance since only one sampling metadata is created for the entire batch.

WoosukKwon

LGTM! Thanks for the PR!

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Richard Liu <[email protected]>

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

[V1] Support per-request seed

605c344

And make batch generators sparse Signed-off-by: Nick Hill <[email protected]>

njhill requested a review from WoosukKwon November 2, 2024 01:37

njhill commented Nov 2, 2024

View reviewed changes

vllm-project deleted a comment from github-actions bot Nov 2, 2024

WoosukKwon approved these changes Nov 3, 2024

View reviewed changes

WoosukKwon merged commit 1f1b6d6 into vllm-project:main Nov 3, 2024
32 checks passed

njhill deleted the v1-seed branch November 4, 2024 17:54

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Nov 4, 2024

[V1] Support per-request seed (vllm-project#9945)

8767bd5

Signed-off-by: Nick Hill <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Nov 4, 2024

[V1] Support per-request seed (vllm-project#9945)

0565fe9

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

richardsliu pushed a commit to richardsliu/vllm that referenced this pull request Nov 4, 2024

[V1] Support per-request seed (vllm-project#9945)

63c4c09

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Richard Liu <[email protected]>

bigPYJ1151 pushed a commit to bigPYJ1151/vllm that referenced this pull request Nov 5, 2024

[V1] Support per-request seed (vllm-project#9945)

f559ba2

Signed-off-by: Nick Hill <[email protected]>

DarkLight1337 pushed a commit that referenced this pull request Nov 5, 2024

[V1] Support per-request seed (#9945)

06eaf6b

Signed-off-by: Nick Hill <[email protected]>

hissu-hyvarinen pushed a commit to ROCm/vllm that referenced this pull request Nov 6, 2024

[V1] Support per-request seed (vllm-project#9945)

66ec4c3

Signed-off-by: Nick Hill <[email protected]>

JC1DA pushed a commit to JC1DA/vllm that referenced this pull request Nov 11, 2024

[V1] Support per-request seed (vllm-project#9945)

4584484

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[V1] Support per-request seed (vllm-project#9945)

094e726

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[V1] Support per-request seed (vllm-project#9945)

3ca4ca4

Signed-off-by: Nick Hill <[email protected]>

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[V1] Support per-request seed (vllm-project#9945)

39db1b5

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1] Support per-request seed #9945

[V1] Support per-request seed #9945

njhill commented Nov 2, 2024

njhill Nov 2, 2024

WoosukKwon Nov 3, 2024

WoosukKwon left a comment

[V1] Support per-request seed #9945

[V1] Support per-request seed #9945

Conversation

njhill commented Nov 2, 2024

njhill Nov 2, 2024

Choose a reason for hiding this comment

WoosukKwon Nov 3, 2024

Choose a reason for hiding this comment

WoosukKwon left a comment

Choose a reason for hiding this comment