[BUG FIX]: prevent EngineCore crash when Qwen TTS Base task is missing ref_text by teith · Pull Request #2203 · vllm-project/vllm-omni

teith · 2026-03-26T00:39:18Z

A single malformed request to /v1/audio/speech with task_type=Base and no ref_text kills the entire EngineCore. All subsequent requests — including valid ones — fail with EngineDeadError.

Reproduce:
serve:
vllm-omni serve Qwen/Qwen3-TTS-12Hz-1.7B-Base --omni --trust-remote-code --host 0.0.0.0 --port 8000 --enforce-eager

from openai import OpenAI
client = OpenAI(api_key="none", base_url="http://localhost:8000/v1")
client.audio.speech.create(
    model="Qwen/Qwen3-TTS-12Hz-1.7B-Base",
    voice="clone",
    input="Hello",
    extra_body={
        "task_type": "Base",
        "ref_audio": "https://cdn-media.huggingface.co/speech_samples/sample1.flac",
    },
)
# Engine is now dead. Every subsequent request returns EngineDeadError.

Root cause: _build_prompt_embeds() raises ValueError when ref_text is missing in ICL mode. vLLM v1 treats any worker exception as fatal.

Fix:
serving_speech.py: validate ref_text presence for Base task before the request reaches the engine (returns HTTP 400)
qwen3_tts_talker.py: fall back to x-vector-only mode instead of raising, matching the existing non-ICL codepath (protects offline inference and other entrypoints)

yenuo26 · 2026-03-26T09:15:46Z

Do we need to add test cases for this scenario? @linyueqian

linyueqian · 2026-03-26T13:24:21Z

Do we need to add test cases for this scenario? @linyueqian

yes, it would be great.

linyueqian

LGTM

linyueqian · 2026-03-26T13:25:17Z

fix pre-commit please

linyueqian · 2026-03-27T01:24:36Z

fix dco please

lishunyang12

left a minor comment, otherwise looks good

hsliuustc0106 · 2026-04-02T23:56:38Z

fix ci

Parametrized test covering ref_text=None, "", and whitespace-only to ensure the serving layer returns HTTP 400 instead of letting a malformed request crash the EngineCore. Regression test for vllm-project#2203 Signed-off-by: Yueqian Lin <yueqian.lin@outlook.com>

… TTS Base task A single malformed request to /v1/audio/speech with task_type=Base and no ref_text kills the entire EngineCore. All subsequent requests fail with EngineDeadError. Root cause: _build_prompt_embeds() raises ValueError when ref_text is missing in ICL mode. vLLM v1 treats any worker exception as fatal. Fix: - serving_speech.py: validate ref_text presence for Base task before the request reaches the engine (returns HTTP 400) - qwen3_tts_talker.py: fall back to x-vector-only mode instead of raising, matching the existing non-ICL codepath Also adds a parametrized regression test covering ref_text=None, "", and whitespace-only inputs. Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

…g ref_text (vllm-project#2203) Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

teith requested a review from hsliuustc0106 as a code owner March 26, 2026 00:39

linyueqian approved these changes Mar 26, 2026

View reviewed changes

linyueqian added the ready label to trigger buildkite CI label Mar 26, 2026

teith force-pushed the fix/qwen3-tts-missing-ref-text-crash branch 2 times, most recently from 152f1b6 to 1316136 Compare March 27, 2026 03:43

lishunyang12 reviewed Apr 2, 2026

View reviewed changes

Comment thread vllm_omni/model_executor/models/qwen3_tts/qwen3_tts_talker.py

hsliuustc0106 mentioned this pull request Apr 2, 2026

[Enhancement] Engine runtime errors #2426

Merged

5 tasks

linyueqian force-pushed the fix/qwen3-tts-missing-ref-text-crash branch 2 times, most recently from 2eabe77 to 85325d8 Compare April 11, 2026 02:16

linyueqian force-pushed the fix/qwen3-tts-missing-ref-text-crash branch from 85325d8 to a0cd54c Compare April 11, 2026 02:17

linyueqian enabled auto-merge (squash) April 11, 2026 02:17

linyueqian merged commit 001f2e3 into vllm-project:main Apr 11, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG FIX]: prevent EngineCore crash when Qwen TTS Base task is missing ref_text#2203

[BUG FIX]: prevent EngineCore crash when Qwen TTS Base task is missing ref_text#2203
linyueqian merged 1 commit into
vllm-project:mainfrom
teith:fix/qwen3-tts-missing-ref-text-crash

teith commented Mar 26, 2026 •

edited

Loading

Uh oh!

yenuo26 commented Mar 26, 2026

Uh oh!

linyueqian commented Mar 26, 2026

Uh oh!

linyueqian left a comment

Uh oh!

linyueqian commented Mar 26, 2026

Uh oh!

linyueqian commented Mar 27, 2026

Uh oh!

lishunyang12 left a comment

Uh oh!

Uh oh!

hsliuustc0106 commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

teith commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yenuo26 commented Mar 26, 2026

Uh oh!

linyueqian commented Mar 26, 2026

Uh oh!

linyueqian left a comment

Choose a reason for hiding this comment

Uh oh!

linyueqian commented Mar 26, 2026

Uh oh!

linyueqian commented Mar 27, 2026

Uh oh!

lishunyang12 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hsliuustc0106 commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

teith commented Mar 26, 2026 •

edited

Loading