[Test] Add voice or language test case for Qwen3-omni and Qwen-tts by yenuo26 · Pull Request #1844 · vllm-project/vllm-omni

yenuo26 · 2026-03-12T08:49:10Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

1.Add speaker and language test cases for Qwen3-omni
2.Add voice case-sensitive test cases for Qwen-tts

Test Plan

Qwen3-omni

pytest -sv tests/e2e/online_serving/test_qwen3_omni_expansion.py -k "test_speaker or test_language" --run-level advanced_model

2.Qwen-tts

pytest -sv tests/e2e/online_serving/test_qwen3_tts_customvoice_expansion.py::test_voice_003 --run-level advanced_model

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

- Added OpenCC dependency for text conversion in preprocess_text function. - Introduced functions to estimate voice gender from audio and validate against expected gender in assert_omni_response. - Refactored audio merging logic into a separate function for clarity. - Updated test cases to include voice gender specifications and improved parameter handling for model configurations. Signed-off-by: yenuo26 <410167048@qq.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 16e54f14bd

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-12T08:54:13Z

+    if f0 > 170.0:
+        half = f0 / 2.0
+        if 75.0 <= half <= 185.0:
+            f0 = half


Avoid unconditional octave halving in gender estimator

This octave-correction branch halves any f0 > 170 when the halved value is in 75..185, which incorrectly maps most genuine female pitches (typically ~180–255 Hz) into the male range before classification. In practice, a correctly estimated female voice around 200–230 Hz gets forced to ~100–115 Hz and then labeled male, so the new female-voice validations can fail even when the model output is correct.

Useful? React with 👍 / 👎.

Signed-off-by: wangyu <410167048@qq.com>

…and Qwen3-omni Signed-off-by: wangyu <410167048@qq.com>

Signed-off-by: wangyu <410167048@qq.com>

yenuo26 · 2026-04-01T03:28:31Z

@congw729 please help to add nightly-test label

…gging and efficiency. Signed-off-by: wangyu <410167048@qq.com>

Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>

Signed-off-by: wangyu <410167048@qq.com>

yenuo26 · 2026-04-01T11:05:19Z

success in nightly-test:

success in merge-test:

The Bagel failure is a known issue and is being tracked in Issue #2416
Please review whether this PR can be merged. @gcanlin @Gaohan123

Signed-off-by: wangyu <410167048@qq.com>

gcanlin

LGTM

…llm-project#1844) Signed-off-by: yenuo26 <410167048@qq.com> Signed-off-by: wangyu <410167048@qq.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>

yenuo26 requested a review from hsliuustc0106 as a code owner March 12, 2026 08:49

chatgpt-codex-connector Bot reviewed Mar 12, 2026

View reviewed changes

This was referenced Mar 12, 2026

[Bug]: Qwen3-omni, when I specify the voice as female in the system prompt, the output audio is still male. #1845

Closed

[Feature] support to change the speaker of qwen3-omni #1963

Merged

yenuo26 closed this Mar 31, 2026

yenuo26 deleted the voice branch March 31, 2026 11:31

yenuo26 restored the voice branch March 31, 2026 11:32

yenuo26 reopened this Mar 31, 2026

yenuo26 added 3 commits March 31, 2026 19:39

Merge remote-tracking branch 'upstream/main' into voice

0e65a98

Signed-off-by: wangyu <410167048@qq.com>

Simplify voice gender checks and enhance test coverage for Qwen3 TTS …

768b3fd

…and Qwen3-omni Signed-off-by: wangyu <410167048@qq.com>

Fix case sensitivity in keyword assertions for OmniResponse tests

7e1af22

Signed-off-by: wangyu <410167048@qq.com>

yenuo26 changed the title ~~[WIP][Test] Add qwen3-omni voice and language test case~~ [Test] Add voice or language test case for Qwen3-omni and Qwen-tts Mar 31, 2026

yenuo26 force-pushed the voice branch from 43b3897 to ea870d6 Compare April 1, 2026 02:14

modify voice

0543d96

Signed-off-by: wangyu <410167048@qq.com>

yenuo26 force-pushed the voice branch from ea870d6 to 0543d96 Compare April 1, 2026 02:58

modify prompt

2f43bbb

Signed-off-by: wangyu <410167048@qq.com>

congw729 added the nightly-test label to trigger buildkite nightly test CI label Apr 1, 2026

yenuo26 and others added 6 commits April 1, 2026 12:14

Consolidate pytest commands in CI configuration files for improved lo…

8ef9227

…gging and efficiency. Signed-off-by: wangyu <410167048@qq.com>

Merge branch 'main' into voice

95af2bc

Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>

Refactor CI pytest command to streamline test execution for core_model.

2d3f7f5

Signed-off-by: wangyu <410167048@qq.com>

Merge branch 'voice' of https://github.com/yenuo26/vllm-omni into voice

0c5c81e

Update IMAGE_KEY

952575b

Signed-off-by: wangyu <410167048@qq.com>

Merge branch 'main' into voice

0b67b01

yenuo26 mentioned this pull request Apr 1, 2026

[CI failure]: nightly Omni model test with H100 fails due to missing keywords #2415

Closed

1 task

congw729 added ready label to trigger buildkite CI and removed nightly-test label to trigger buildkite nightly test CI labels Apr 1, 2026

yenuo26 added 2 commits April 1, 2026 19:01

test merge ci

ae0ef18

Signed-off-by: wangyu <410167048@qq.com>

Merge branch 'voice' of https://github.com/yenuo26/vllm-omni into voice

13c464c

yenuo26 changed the title ~~[Test] Add voice or language test case for Qwen3-omni and Qwen-tts~~ [WIP][Test] Add voice or language test case for Qwen3-omni and Qwen-tts Apr 1, 2026

yenuo26 added 2 commits April 1, 2026 19:36

remove merge test

5b524e4

Signed-off-by: wangyu <410167048@qq.com>

remove merge test

4e800e8

Signed-off-by: wangyu <410167048@qq.com>

yenuo26 changed the title ~~[WIP][Test] Add voice or language test case for Qwen3-omni and Qwen-tts~~ [Test] Add voice or language test case for Qwen3-omni and Qwen-tts Apr 1, 2026

gcanlin approved these changes Apr 2, 2026

View reviewed changes

gcanlin merged commit 9c2a576 into vllm-project:main Apr 2, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test] Add voice or language test case for Qwen3-omni and Qwen-tts#1844

[Test] Add voice or language test case for Qwen3-omni and Qwen-tts#1844
gcanlin merged 16 commits intovllm-project:mainfrom
yenuo26:voice

yenuo26 commented Mar 12, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 12, 2026

Uh oh!

yenuo26 commented Apr 1, 2026

Uh oh!

yenuo26 commented Apr 1, 2026 •

edited

Loading

Uh oh!

gcanlin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yenuo26 commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 commented Apr 1, 2026

Uh oh!

yenuo26 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gcanlin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yenuo26 commented Mar 12, 2026 •

edited

Loading

yenuo26 commented Apr 1, 2026 •

edited

Loading