Skip to content

[Bugfix] Truncate mimo-audio code2wav prompt to MAX_CODE2WAV_TOKENS#2693

Merged
lishunyang12 merged 5 commits into
vllm-project:mainfrom
lishunyang12:fix-2683-mimo-audio-code-len
Apr 20, 2026
Merged

[Bugfix] Truncate mimo-audio code2wav prompt to MAX_CODE2WAV_TOKENS#2693
lishunyang12 merged 5 commits into
vllm-project:mainfrom
lishunyang12:fix-2683-mimo-audio-code-len

Conversation

@lishunyang12

@lishunyang12 lishunyang12 commented Apr 11, 2026

Copy link
Copy Markdown
Collaborator

Purpose

Fixes #2683.

Stage-1 (code2wav) crashes with

ValueError: could not broadcast input array from shape (55656,) into shape (8192,)
  at vllm/v1/worker/gpu_input_batch.py:342 in add_request

whenever the online llm2code2wav input processor flattens a long stage-0 talker output and hands it to stage-1. Each talker frame expands to 36 ids after prepend_and_flatten_colmajor (pad row + col-major reshape of an (8, 4) codec block), so ~1546 frames already produce 55656 ids — well past mimo_audio.yaml's stage-1 max_model_len: 18192, and far past whatever lower cap the underlying code2wav model enforces.

The offline path (examples/offline_inference/mimo_audio/end2end.py) already guards against this exact failure with MAX_CODE2WAV_TOKENS = 18192 + a warn-and-truncate. The online stage input processor was missing the same safeguard, so any long-enough request killed the engine core.

Test Result

Fixes vllm-project#2683

Signed-off-by: lishunyang <lishunyang12@163.com>
@chatgpt-codex-connector

Copy link
Copy Markdown

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

@lishunyang12

Copy link
Copy Markdown
Collaborator Author

@qibaoyuan PTAL

@qibaoyuan

Copy link
Copy Markdown
Contributor

LGTM

@lishunyang12 lishunyang12 enabled auto-merge (squash) April 13, 2026 06:55
@Gaohan123 Gaohan123 added the ready label to trigger buildkite CI label Apr 14, 2026
@Gaohan123 Gaohan123 added this to the v0.20.0 milestone Apr 14, 2026

@Gaohan123 Gaohan123 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lishunyang12 lishunyang12 left a comment

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ci failed.

@lishunyang12 lishunyang12 merged commit 618268d into vllm-project:main Apr 20, 2026
7 of 8 checks passed
lvliang-intel pushed a commit to lvliang-intel/vllm-omni that referenced this pull request Apr 21, 2026
nainiu258 pushed a commit to nainiu258/vllm-omni that referenced this pull request Apr 21, 2026
…llm-project#2693)

Signed-off-by: lishunyang <lishunyang12@163.com>
Signed-off-by: nainiu258 <cperfect02@163.com>
qinganrice pushed a commit to qinganrice/vllm-omni that referenced this pull request Apr 23, 2026
lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026
clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026
daixinning pushed a commit to daixinning/vllm-omni that referenced this pull request May 28, 2026
quyifei23 pushed a commit to quyifei23/vllm-omni that referenced this pull request Jun 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready label to trigger buildkite CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]:online_serving mimo_audio bug

3 participants