[BugFix] Async scheduling: handle model forward errors more cleanly by njhill · Pull Request #31611 · vllm-project/vllm

njhill · 2026-01-02T00:40:44Z

If the model runner execute_model() method raises an exception, it will be logged via a future callback, but the core loop will subsequently fail with a misleading secondary exception since sample_tokens() will return None:

`AttributeError: 'NoneType' object has no attribute 'sampled_token_ids'`

This PR changes the step_with_batch_queue() method in the core loop to instead raise the root cause exception from the execute_model() future inline in this case, which also removes the need for the error callback.

If the model runner execute_model() raises an exception, it will be logged via a future callback, but the core loop will subsequently fail with a misleading exception since sample_tokens() will return None: `AttributeError: 'NoneType' object has no attribute 'sampled_token_ids'` This PR changes the step_with_batch_queue() method to instead raise the root cause exception from the execute_model() future inline in this case, which also removes the need for the error callback. Signed-off-by: njhill <nickhill123@gmail.com>

gemini-code-assist

Code Review

This pull request refactors the error handling in asynchronous scheduling to propagate the root cause exception from execute_model() failures, which is a good improvement. However, the implementation introduces a critical bug where a successful execution path can lead to a RuntimeError. My review includes a comment with a suggested fix for this issue.

vllm/v1/engine/core.py

Signed-off-by: Hochan Son <ohsono@gmail.com>

njhill · 2026-01-02T20:54:03Z

CI failure is unrelated

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

Signed-off-by: Hochan Son <ohsono@gmail.com>

mergify bot added the v1 label Jan 2, 2026

gemini-code-assist bot reviewed Jan 2, 2026

View reviewed changes

vllm/v1/engine/core.py Show resolved Hide resolved

njhill marked this pull request as ready for review January 2, 2026 00:49

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 2, 2026

njhill mentioned this pull request Jan 2, 2026

Fix: Add None check in step_with_batch_queue for async scheduling #31600

Closed

njhill changed the title ~~[Core] Async scheduling: handle model forward errors more cleanly~~ [BugFix] Async scheduling: handle model forward errors more cleanly Jan 2, 2026

njhill mentioned this pull request Jan 2, 2026

[Bugfix] Add SM 12.1 support + Fix GPT-OSS Harmony garbled reasoning and HarmonyError crashes #31607

Open

4 tasks

ohsono added a commit to ohsono/vllm that referenced this pull request Jan 2, 2026

Remove V1 engine crash fix - will be handled by PR vllm-project#31611

3fae79f

Signed-off-by: Hochan Son <ohsono@gmail.com>

mgoin approved these changes Jan 4, 2026

View reviewed changes

vllm-bot merged commit b53b89f into vllm-project:main Jan 4, 2026
48 of 50 checks passed

njhill deleted the async-exec-errs branch January 4, 2026 19:35

LucasWilkinson pushed a commit to neuralmagic/vllm that referenced this pull request Jan 6, 2026

[BugFix] Async scheduling: handle model forward errors more cleanly (v…

08b3492

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[BugFix] Async scheduling: handle model forward errors more cleanly (v…

cfc1e68

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[BugFix] Async scheduling: handle model forward errors more cleanly (v…

2aac40b

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[BugFix] Async scheduling: handle model forward errors more cleanly (v…

567c540

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[BugFix] Async scheduling: handle model forward errors more cleanly (v…

0c10f14

…llm-project#31611) Signed-off-by: njhill <nickhill123@gmail.com>

ohsono added a commit to ohsono/vllm that referenced this pull request Feb 25, 2026

Remove V1 engine crash fix - will be handled by PR vllm-project#31611

25ed9da

Signed-off-by: Hochan Son <ohsono@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Async scheduling: handle model forward errors more cleanly#31611

[BugFix] Async scheduling: handle model forward errors more cleanly#31611
vllm-bot merged 1 commit intovllm-project:mainfrom
njhill:async-exec-errs

njhill commented Jan 2, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

njhill commented Jan 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

njhill commented Jan 2, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

njhill commented Jan 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

njhill commented Jan 2, 2026 •

edited by github-actions bot

Loading