Skip to content

[Test] Add qwen3-omni tests for audio_in_video and one word prompt#2097

Merged
hsliuustc0106 merged 19 commits into
vllm-project:mainfrom
yenuo26:audio_in_video
Mar 31, 2026
Merged

[Test] Add qwen3-omni tests for audio_in_video and one word prompt#2097
hsliuustc0106 merged 19 commits into
vllm-project:mainfrom
yenuo26:audio_in_video

Conversation

@yenuo26
Copy link
Copy Markdown
Collaborator

@yenuo26 yenuo26 commented Mar 23, 2026

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Add qwen3-omni tests for audio_in_video and one word prompt

Test Plan

1.run in local env

/workspace/.venv/bin/python -m pytest -s -v tests/e2e/online_serving/test_qwen3_omni_expansion.py -k "test_audio_in_video_001"  -m "advanced_model" --run-level "advanced_model"

2.run in ci

Test Result

1.local

==================================================================== 1 passed, 18 warnings in 132.98s (0:02:12) ====================================================================

2.ci
bc2d3920-677e-4a33-acf9-e1310e6a7724


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
  • The test results. Please paste the results comparison before and after, or the e2e results.
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
  • (Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: yenuo26 <410167048@qq.com>
@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 23, 2026

@Shirley125 @amy-why-3459 PTAL

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 42387a32de

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Comment thread tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated
…d increasing max tokens in CI configuration

Signed-off-by: yenuo26 <410167048@qq.com>
@amy-why-3459
Copy link
Copy Markdown
Contributor

Please add a multi-concurrency precision test case for audio_in_video.

@yenuo26 yenuo26 requested a review from hsliuustc0106 as a code owner March 23, 2026 12:36
@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 23, 2026

Please add a multi-concurrency precision test case for audio_in_video.

done

@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 23, 2026

@Gaohan123 @tzhouam please help to add a ready label

@hsliuustc0106 hsliuustc0106 added the ready label to trigger buildkite CI label Mar 23, 2026
Copy link
Copy Markdown
Collaborator

@hsliuustc0106 hsliuustc0106 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BLOCKER scan:

  • Correctness: ISSUES: test_audio_in_video_001/002 still do not assert on audio-derived content, so the new use_audio_in_video path can break while the tests continue to pass by describing only the video.
  • Reliability/Safety: ISSUES: .buildkite/test-ready.yml now hard-switches the H100 job to an advanced expansion test with an in-file note saying it is for debug and should be removed before merge. That changes the repo's normal CI signal rather than adding stable coverage.
  • Breaking Changes: PASS
  • Test Coverage: needs tests/evidence that specifically fail when embedded audio is ignored.
  • Documentation: PASS (test-only PR)
  • Security: PASS

OVERALL: 2 BLOCKERS FOUND

VERDICT: REQUEST_CHANGES

I validated the new use_audio_in_video wiring in send_omni_request() and the new expansion tests. The remaining blockers are that the assertions still don't depend on extracted audio content, and the Buildkite change is explicitly checked in as a temporary debug path instead of stable CI coverage.

Comment thread .buildkite/test-ready.yml Outdated
timeout 20m bash -c '
export VLLM_WORKER_MULTIPROC_METHOD=spawn
export VLLM_TEST_CLEAN_GPU_MEMORY="1"
#pytest -s -v tests/e2e/online_serving/test_qwen3_omni.py -m "core_model" --run-level "core_model"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This swaps the normal H100 omni test job over to test_qwen3_omni_expansion.py and even leaves a for debug, will be removed before merging note in the committed pipeline. That means the PR is changing repository CI coverage in a temporary/debug-only way instead of adding a stable test signal. Could you revert the debug pipeline override and keep the new coverage in a dedicated test/job if it needs CI coverage?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

@hsliuustc0106
Copy link
Copy Markdown
Collaborator

Review Summary

Verdict: ✅ LGTM (No Blockers)

Test-only PR for Qwen3-omni edge cases. Good coverage expansion!

Test Quality Check

Test correctness: Tests use standard omni_server + openai_client fixtures
Coverage: Tests 3 scenarios:

  • test_audio_in_video_001 - audio-in-video input
  • test_audio_in_video_002 - different audio-in-video config
  • test_one_word_prompt_001 - minimal text input edge case

Test evidence: PR shows 1 passed in 132.98s

Non-blocking Observations

conftest.py: Large refactoring (121 additions, 32 deletions) - test infrastructure improvements. Not reviewing in detail since it's test infrastructure.

test-ready.yml: Modified CI config (39 additions, 38 deletions) - appears to be test scheduling changes.


Ready to merge.

yenuo26 and others added 6 commits March 24, 2026 09:35
…ken configuration

Signed-off-by: yenuo26 <410167048@qq.com>
Signed-off-by: yenuo26 <410167048@qq.com>
…to include audio in the output MP4. Update tests to utilize the new feature and adjust CI configuration to reduce `max_tokens` for improved performance.

Signed-off-by: yenuo26 <410167048@qq.com>
Signed-off-by: wangyu <410167048@qq.com>
@Gaohan123
Copy link
Copy Markdown
Collaborator

Please fix docs

@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 25, 2026

Please fix docs

Actually, I didn't modify any document. I will retry it.

@Gaohan123
Copy link
Copy Markdown
Collaborator

@yenuo26 Please resolve conflicts

@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 25, 2026

@yenuo26 Please resolve conflicts
There are currently no conflicts to resolve.

openai_client.send_omni_request(request_config, request_num=get_max_batch_size())


@pytest.mark.skip(reason="There is a known issue: https://github.com/vllm-project/vllm-omni/pull/2019")
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR #2019 has been merged. Please remove the skip option and test whether the test cases pass.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

During local testing, I found that when called concurrently, this error is not resolved.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK, I will continue to follow up on this issue and will remove the skip flag once it is resolved.

@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 26, 2026

@Gaohan123 @david6666666 please help to add nightly-test

@congw729 congw729 added the nightly-test label to trigger buildkite nightly test CI label Mar 26, 2026
@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 26, 2026

Can this be merged? I've already run L2+L4. The vllm-omni CI failed because the Diffusion L4 performance test didn't pass. @Gaohan123

yenuo26 added 3 commits March 27, 2026 10:18
…ecision for unique timestamps, preventing file overwrites during concurrent calls.

Signed-off-by: wangyu <410167048@qq.com>
…d checks for similarity and normalized matching against audio_transcript_key_words. Updated test cases to reflect new requirements.

Signed-off-by: wangyu <410167048@qq.com>
@yenuo26
Copy link
Copy Markdown
Collaborator Author

yenuo26 commented Mar 28, 2026

b38ebbd1-d69d-400e-bd9f-a16c7816facd

I just modify test case in "Omni Model Test with H100", and it is success in nightly test.
Please review whether this PR can be merged. @Gaohan123 @david6666666

Copy link
Copy Markdown
Collaborator

@gcanlin gcanlin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@gcanlin gcanlin removed the nightly-test label to trigger buildkite nightly test CI label Mar 28, 2026
@gcanlin gcanlin requested a review from hsliuustc0106 March 28, 2026 11:10
Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>
@hsliuustc0106 hsliuustc0106 merged commit c1a978a into vllm-project:main Mar 31, 2026
7 of 8 checks passed
@yenuo26 yenuo26 deleted the audio_in_video branch March 31, 2026 11:33
vraiti pushed a commit to vraiti/vllm-omni that referenced this pull request Apr 9, 2026
…llm-project#2097)

Signed-off-by: yenuo26 <410167048@qq.com>
Signed-off-by: wangyu <410167048@qq.com>
Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>
Co-authored-by: Alicia <115451386+congw729@users.noreply.github.com>
lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026
…llm-project#2097)

Signed-off-by: yenuo26 <410167048@qq.com>
Signed-off-by: wangyu <410167048@qq.com>
Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>
Co-authored-by: Alicia <115451386+congw729@users.noreply.github.com>
clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026
…llm-project#2097)

Signed-off-by: yenuo26 <410167048@qq.com>
Signed-off-by: wangyu <410167048@qq.com>
Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>
Co-authored-by: Alicia <115451386+congw729@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready label to trigger buildkite CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants