[BugFix] Fix mixed penalties batch with async scheduling by njhill · Pull Request #27910 · vllm-project/vllm

njhill · 2025-11-01T06:27:58Z

#26467 fixed compatibility of penalties sampling parameters with async scheduling but has a flaw that it breaks in cases where there is a mix of requests with and without penalties in the batch, specifically if a request with a penalties param starts while a batch without penalties is already running.

This is a fix for that case.

gemini-code-assist

Code Review

This pull request addresses a bug in async scheduling where a batch containing a mix of requests with and without penalties could fail. The fix involves replacing placeholder -1 token IDs with a valid token ID to prevent errors in downstream operations. The approach is sound. I've suggested a minor improvement to make the fix more robust by using vocab_size as the replacement value, which is already used as a padding/ignore value, instead of 0.

vllm/v1/sample/ops/penalties.py

Signed-off-by: Nick Hill <nhill@redhat.com>

…t#27910) Signed-off-by: Nick Hill <nhill@redhat.com>

hidva · 2025-12-06T12:22:57Z

vllm/v1/sample/ops/penalties.py

+    # scatter done in apply_penalties is valid.
+    # NOTE(nick): The penalties implementation is currently quite inefficient and
+    # will be reworked anyhow.
+    output_tokens_t.masked_fill_(output_tokens_t == -1, vocab_size)


Hi, I’d like to ask why the actual draft token isn’t used here to replace the placeholder? Thanks.

@njhill

@hidva this line should only apply to requests/rows in the batch which don't require the output tokens (i.e. those which don't use penalties sampling parameters). The other rows should not contain any placeholder tokens at this point.

Also async scheduling + spec decode + penalties isn't yet supported (any help with that appreciated though - see discussion in #30122).

njhill added this to the v0.11.1 milestone Nov 1, 2025

njhill added the bug Something isn't working label Nov 1, 2025

njhill requested review from 22quinn and houseroad as code owners November 1, 2025 06:27

mergify bot added the v1 label Nov 1, 2025

gemini-code-assist bot reviewed Nov 1, 2025

View reviewed changes

vllm/v1/sample/ops/penalties.py Outdated Show resolved Hide resolved

njhill mentioned this pull request Nov 1, 2025

[Bug]: In vllm 0.11.0 async schedule cause applt penalty does not work #27878

Closed

1 task

[BugFix] Fix mixed penalties batch with async scheduling

1aecaef

Signed-off-by: Nick Hill <nhill@redhat.com>

njhill force-pushed the fix-async-penalties branch from 36e9424 to 1aecaef Compare November 1, 2025 06:38

This was referenced Nov 1, 2025

Async Scheduling Plan #27679

Closed

[Core] Enable async scheduling by default #27614

Merged

mgoin approved these changes Nov 1, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 1, 2025

njhill merged commit c2ed069 into vllm-project:main Nov 1, 2025
46 checks passed

njhill deleted the fix-async-penalties branch November 1, 2025 17:51

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

[BugFix] Fix mixed penalties batch with async scheduling (vllm-projec…

e72ba68

…t#27910) Signed-off-by: Nick Hill <nhill@redhat.com>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[BugFix] Fix mixed penalties batch with async scheduling (vllm-projec…

3588e7d

…t#27910) Signed-off-by: Nick Hill <nhill@redhat.com>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[BugFix] Fix mixed penalties batch with async scheduling (vllm-projec…

5ccf617

…t#27910) Signed-off-by: Nick Hill <nhill@redhat.com>

zhandaz mentioned this pull request Dec 1, 2025

Update the docs for --async-scheduling compatibility vllm-project/recipes#131

Merged

hidva reviewed Dec 6, 2025

View reviewed changes

haosdent mentioned this pull request Feb 26, 2026

[Bug]: Crash when using presence_penalty with Qwen3-VL in v0.11.0 #33338

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Fix mixed penalties batch with async scheduling#27910

[BugFix] Fix mixed penalties batch with async scheduling#27910
njhill merged 1 commit intovllm-project:mainfrom
njhill:fix-async-penalties

njhill commented Nov 1, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

hidva Dec 6, 2025 •

edited

Loading

Uh oh!

njhill Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

njhill commented Nov 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

hidva Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

njhill Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

njhill commented Nov 1, 2025 •

edited by github-actions bot

Loading

hidva Dec 6, 2025 •

edited

Loading