[CI/BugFix] Fix Flaky Test for Qwen Omni Perf by alex-jw-brooks · Pull Request #2754 · vllm-project/vllm-omni

alex-jw-brooks · 2026-04-13T22:35:52Z

Purpose

Fixes the flaky test in the build linked here: #2752 #2389

Streaming requests in vLLM / vLLM Omni follow SSE specification. Since we largely send data, this mostly means that we are sending things like:

b'data: {JSON}\n\n'

Importantly, the space after the : does matter. In our performance script, we are currently .strip() ing all incoming chunks. This is the underlying cause of the erratic CI failures, because higher concurrency in streaming requests can lead to situations like this:

chunk1: b'data: '
chunk2: b'{JSON}\n\n'

When we encounter this case, add_chunks on the handler will add the stripped messages, i.e., giving data:{JSON}\n\n. As a result chunk = message.removeprefix("data: ") later in our script doesn't do anything, and it tries to decode the JSON with the data: in front, which causes the parsing error.

Reproducing it is a bit difficult, but I did log one of the failed requests out and did see the leading data: on it when decoding failed. The best way to repro is likely with a higher concurrency config, e.g., running pytest tests/dfx/perf/scripts/run_benchmark.py -s with the config path set to point to something like below:

[
    {
        "test_name": "test_qwen3_omni_chunk_stress",
        "server_params": {
            "model": "Qwen/Qwen3-Omni-30B-A3B-Instruct",
            "stage_config_name": "qwen3_omni.yaml",
            "update": {
                "async_chunk": true,
                "stage_args": {
                    "0": {
                        "engine_args.custom_process_next_stage_input_func": "vllm_omni.model_executor.stage_input_processors.qwen3_omni.thinker2talker_async_chunk"
                    },
                    "1": {
                        "engine_args.custom_process_next_stage_input_func": "vllm_omni.model_executor.stage_input_processors.qwen3_omni.talker2code2wav_async_chunk"
                    }
                }
            },
            "delete": {
                "stage_args": {
                    "2": [
                        "custom_process_input_func"
                    ]
                }
            }
        },
        "benchmark_params": [
            {
                "dataset_name": "random",
                "backend": "openai-chat-omni",
                "endpoint": "/v1/chat/completions",
                "num_prompts": 500,
                "max_concurrency": 32,
                "random_input_len": 100,
                "random_output_len": 100,
                "ignore_eos": true,
                "percentile-metrics": "ttft,tpot,itl,e2el,audio_rtf,audio_ttfp,audio_duration",
                "baseline": {
                    "mean_ttft_ms": 10000,
                    "mean_audio_ttfp_ms": 10000,
                    "mean_audio_rtf": 1.0
                }
            }
        ]
    }
]

CC @tzhouam

Signed-off-by: Alex Brooks <albrooks@redhat.com>

chatgpt-codex-connector · 2026-04-13T22:35:56Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

amy-why-3459 · 2026-04-14T01:01:54Z

#2389
Thank you so much for your fix. I believe your fix will also resolve this issue.

hsliuustc0106 · 2026-04-14T09:36:55Z

BLOCKER scan:

Correctness: PASS
Reliability/Safety: PASS
Breaking Changes: PASS
Test Coverage: PASS (CI tests verify)
Documentation: PASS
Security: PASS

OVERALL: NO BLOCKERS

VERDICT: COMMENT

Good catch on the TCP fragmentation issue. SSE parsing requires exact handling of whitespace - stripping can break the protocol. The comment explaining the issue is clear and helpful.

yenuo26 · 2026-04-14T11:48:59Z

@Gaohan123 @gcanlin @princepride Please help review whether this is ready to be merged.

Gaohan123

LGTM. Thanks!

Signed-off-by: Alex Brooks <albrooks@redhat.com>

fix wrong benchmark strip

c195084

Signed-off-by: Alex Brooks <albrooks@redhat.com>

alex-jw-brooks requested a review from hsliuustc0106 as a code owner April 13, 2026 22:35

yenuo26 added nightly-test label to trigger buildkite nightly test CI ready label to trigger buildkite CI and removed nightly-test label to trigger buildkite nightly test CI labels Apr 14, 2026

Gaohan123 approved these changes Apr 14, 2026

View reviewed changes

Gaohan123 merged commit cf1fcd5 into vllm-project:main Apr 14, 2026
8 checks passed

Gaohan123 mentioned this pull request Apr 14, 2026

[CI Failure]: Omni Model Perf Test，when sending 100 requests, an occasional single request fails. #2389

Closed

1 task

y123456y78 pushed a commit to y123456y78/vllm-omni that referenced this pull request Apr 15, 2026

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf (vllm-project#2754)

d42e152

Signed-off-by: Alex Brooks <albrooks@redhat.com>

lvliang-intel pushed a commit to lvliang-intel/vllm-omni that referenced this pull request Apr 20, 2026

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf (vllm-project#2754)

d80f5ae

Signed-off-by: Alex Brooks <albrooks@redhat.com>

lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf (vllm-project#2754)

d5e2739

Signed-off-by: Alex Brooks <albrooks@redhat.com>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf (vllm-project#2754)

6513557

Signed-off-by: Alex Brooks <albrooks@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf#2754

[CI/BugFix] Fix Flaky Test for Qwen Omni Perf#2754
Gaohan123 merged 1 commit into
vllm-project:mainfrom
alex-jw-brooks:fix_stream_parse

alex-jw-brooks commented Apr 13, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 13, 2026

Uh oh!

amy-why-3459 commented Apr 14, 2026

Uh oh!

hsliuustc0106 commented Apr 14, 2026

Uh oh!

yenuo26 commented Apr 14, 2026

Uh oh!

Gaohan123 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

alex-jw-brooks commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

chatgpt-codex-connector Bot commented Apr 13, 2026

Uh oh!

amy-why-3459 commented Apr 14, 2026

Uh oh!

hsliuustc0106 commented Apr 14, 2026

Uh oh!

yenuo26 commented Apr 14, 2026

Uh oh!

Gaohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

alex-jw-brooks commented Apr 13, 2026 •

edited

Loading