[BugFix] Fix TypeError crash during dummy_run in OmniGPUModelRunner by Sy0307 · Pull Request #2831 · vllm-project/vllm-omni

Sy0307 · 2026-04-15T18:09:35Z

Purpose

Fix TypeError: Unexpected model output type: <class 'torch.Tensor'> crash during profile_run / dummy_run in V2 Omni model runner.

PR #2819 introduced a raise TypeError in the else branch of the hidden state extraction logic. However, during dummy_run=True (called by profile_run → determine_available_memory), the model forward returns a bare torch.Tensor — not OmniOutput or a 2-tuple. The raise kills engine initialization for all Omni models on V2.

Replaces the raise TypeError with a passthrough hidden_states = model_output, which correctly handles bare tensors from dummy runs.

Test Plan

CI passes on H100 (Omni V2 tests: Qwen3-Omni, Qwen2.5-Omni, Qwen3-TTS)
Verify profile_run / determine_available_memory completes without crash

Test Result

Pending CI

Model forward returns a bare torch.Tensor during dummy_run/profile_run, which is neither OmniOutput nor a 2-tuple. The raise TypeError in the else branch kills profile_run → determine_available_memory → engine initialization. Replace with passthrough assignment. Signed-off-by: Sy03 <1370724210@qq.com>

chatgpt-codex-connector · 2026-04-15T18:09:58Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

hsliuustc0106 · 2026-04-15T22:23:01Z

Fix is correct for the immediate issue — dummy_run returns bare torch.Tensor and the previous raise was blocking V2 initialization.

Consider making the passthrough more explicit rather than a catch-all else: could check to avoid silently accepting unexpected types.

Also missing a regression test that verifies profile_run / dummy_run completes without raising TypeError on V2 Omni models.

Sy0307 marked this pull request as ready for review April 15, 2026 18:09

Sy0307 requested a review from hsliuustc0106 as a code owner April 15, 2026 18:09

tzhouam merged commit 3281a6f into vllm-project:dev/migrate-MR-v2 Apr 16, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Fix TypeError crash during dummy_run in OmniGPUModelRunner#2831

[BugFix] Fix TypeError crash during dummy_run in OmniGPUModelRunner#2831
tzhouam merged 1 commit intovllm-project:dev/migrate-MR-v2from
Sy0307:fix/v2-dummy-run-type-error

Sy0307 commented Apr 15, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 15, 2026

Uh oh!

hsliuustc0106 commented Apr 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sy0307 commented Apr 15, 2026

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 15, 2026

Uh oh!

hsliuustc0106 commented Apr 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants