[Test] L4 complete diffusion feature test for Bagel models by NumberWan · Pull Request #1938 · vllm-project/vllm-omni

NumberWan · 2026-03-17T02:35:10Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

This PR adds L4 test for Bagel model(ByteDance-Seed/BAGEL-7B-MoT),
covering all acceleration features currently required by the Model×Feature RFC for Bagel.

Tests are picked up by nightly: test_*_expansion.py -m 'advanced_model and diffusion and H100'.
The tests are intended to be picked up by the L4 nightly diffusion pipeline described in RFC #1832.

Test Plan

The diffusion acceleration doc / #1217 states that Bagel support the following features:

TeaCache
Cache-DiT
CFG-Parallel
Tensor-Parallel

Test Coverage for Bagel (L4, online_serving)

New test file

tests/e2e/online_serving/test_bagel_expansion.py

Model

ByteDance-Seed/BAGEL-7B-MoT

Cases

Test ID	Feature	Server args (minimal for that feature)
`single_card_teacache`	TeaCache	`--cache-backend tea_cache` (1 GPU)
`single_card_cache_dit`	Cache-DiT	`--cache-backend cache_dit` (1 GPU)
`parallel_cfg_2`	CFG-Parallel size 2	`--cache-backend tea_cache --cfg-parallel-size 2` (2 GPUs)
`parallel_tp_2`	Tensor-Parallel size 2	`--cache-backend cache_dit --tensor-parallel-size 2` (2 GPUs)

Test Result

pytest tests/e2e/online_serving/test_bagel_expansion.py --collect-only -m diffusion

tests/e2e/online_serving/test_bagel_expansion.py --collect-only -m diffusion
==================================== test session starts =====================================
platform linux -- Python 3.12.3, pytest-9.0.2, pluggy-1.6.0
rootdir: /home/w00917303/vllm-omni
configfile: pyproject.toml
plugins: anyio-4.12.1
collected 4 items                                                                            

<Dir vllm-omni>
  <Package tests>
    <Package e2e>
      <Package online_serving>
        <Module test_bagel_expansion.py>
          <Function test_bagel[single_card_teacache]>
          <Function test_bagel[single_card_cache_dit]>
          <Function test_bagel[parallel_cfg_2]>
          <Function test_bagel[parallel_tp_2]>

Additional Test Result

Command:

pytest -s -v tests/e2e/online_serving/test_bagel_expansion.py \
 -m "diffusion and advanced_model and H100" \
 --collect-only

collected 4 items
<Dir vllm-omni>
 <Package tests>
   <Package e2e>
     <Package online_serving>
       <Module test_bagel_expansion.py>
         L4 diffusion feature expansion tests for Bagel.
         Coverage:
         - TeaCache
         - Cache-DiT
         - CFG-Parallel
         - Tensor-Parallel
         <Function test_bagel[single_card_teacache]>
         <Function test_bagel[single_card_cache_dit]>
         <Function test_bagel[parallel_cfg_2]>
         <Function test_bagel[parallel_tp_2]>

CI Test Result

CI Successful in both 4 test

Running test: test_bagel[single_card_teacache]
Running test: test_bagel[single_card_cache_dit]
Running test: test_bagel[parallel_cfg_2]
Running test: test_bagel[parallel_tp_2]

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1ca862ce99

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

NumberWan · 2026-03-17T03:47:15Z

@congw729 @yenuo26 PTAL

hsliuustc0106 · 2026-03-17T05:12:38Z

@princepride PTAL

hsliuustc0106 · 2026-03-17T05:14:06Z

@@ -0,0 +1,129 @@
+"""L4 diffusion feature expansion tests for Bagel.


what's the expected result for these tests?

The expected result is that for each configuration (TeaCache / Cache-DiT / CFG-Parallel / TP2), Bagel can successfully generate images via online serving, and assert_diffusion_response verifies that:

images are generated without errors

the number of images equals num_outputs_per_prompt

the resolution matches (height=512, width=512) in extra_body.

At the L4 level we also use these cases in the nightly pipeline to monitor e2e latency under each feature combination.

An additional comment was added at the top of the file to clarify the use of the file.

princepride · 2026-03-17T05:22:11Z

We need compare some specific pixel value to ensure these feature can output expect result, so you can refer to bagel's e2e test, and I also think only use OmniDiffusion is enough to test these feature.

Thanks for the suggestion!
But according to the internal agreement that:

If a diffusion feature is supported in online serving, prefer to test it via online serving unless it only support offline serving.

At the L4 e2e level we currently only validate that images/videos are generated with the expected shape, and record e2e latency.

This Bagel L4 suite follows exactly the same pattern/infra as test_qwen_image_edit_expansion.py and #1682 to keep the behavior consistent across models.

#1832 This is the RFC @princepride

Thank you, I will check it later

Thank you, I will check it later

I think maybe you can unify the bagel test cases in the current test-ready.yml into test_bagel.py, and use the current code style

@princepride Thanks! This PR follows the current L4 diffusion e2e scope in RFC #1832 / template #1682 (online serving, shape checks). . If this is acceptable, could you please dismiss the “changes requested” so this PR can merge?

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

yenuo26 · 2026-03-17T06:41:26Z

I think you should use pytest -s -v tests/e2e/online_serving/test_*_expansion.py -m "advanced_model and diffusion and H100" --collect-only to verify whether the test cases can be collected.
And，could you attach the local execution results?

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

NumberWan · 2026-03-17T07:54:10Z

I think you should use pytest -s -v tests/e2e/online_serving/test_*_expansion.py -m "advanced_model and diffusion and H100" --collect-only to verify whether the test cases can be collected. And，could you attach the local execution results?

Thank you, I just added the test result according to the command you provided. The test result in the "Additional Test Result" part.

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

This reverts commit 321c634. Signed-off-by: NumberWan <wantszkin2003@gmail.com>

NumberWan · 2026-03-19T01:54:22Z

@hsliuustc0106 @Gaohan123 All requested CI test have been passed(The corresponding result in the "Test Result" section.) Could you please help take a look for merge?

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com>

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com> Signed-off-by: Hui <1779066624@qq.com>

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com> Signed-off-by: yiliu30 <yi4.liu@intel.com>

### vllm-omni-audio-tts - Source: [PR #2059](vllm-project/vllm-omni#2059) - [BugFix][Qwen3TTS] CodePredictor CudaGraph Pool - Changes: - Bug fix: [BugFix][Qwen3TTS] CodePredictor CudaGraph Pool ### vllm-omni-perf - Source: [PR #2059](vllm-project/vllm-omni#2059) - [BugFix][Qwen3TTS] CodePredictor CudaGraph Pool - Changes: - Bug fix: [BugFix][Qwen3TTS] CodePredictor CudaGraph Pool ### vllm-omni-api - Source: [PR #2058](vllm-project/vllm-omni#2058) - [Bugfix] Fix Fish Speech and CosyVoice3 online serving - missing is_comprehension and broken model detection - Changes: - Bug fix: [Bugfix] Fix Fish Speech and CosyVoice3 online serving - missing is_comprehension and broken model detection ### vllm-omni-contrib - Source: [PR #2045](vllm-project/vllm-omni#2045) - [Voxtral] Improve example ### vllm-omni-cicd - Source: [PR #2045](vllm-project/vllm-omni#2045) - [Voxtral] Improve example ### vllm-omni-api - Source: [PR #2042](vllm-project/vllm-omni#2042) - [bugfix] /chat/completion doesn't read extra_body for diffusion model - Changes: - Bug fix: [bugfix] /chat/completion doesn't read extra_body for diffusion model ### vllm-omni-perf - Source: [PR #2042](vllm-project/vllm-omni#2042) - [bugfix] /chat/completion doesn't read extra_body for diffusion model - Changes: - Bug fix: [bugfix] /chat/completion doesn't read extra_body for diffusion model ### vllm-omni-contrib - Source: [PR #2038](vllm-project/vllm-omni#2038) - [Doc] Update docs and dockerfiles for rebase of vllm v0.18.0 ### vllm-omni-serving - Source: [PR #2037](vllm-project/vllm-omni#2037) - [Rebase] Rebase to vllm v0.18.0 ### vllm-omni-contrib - Source: [PR #2037](vllm-project/vllm-omni#2037) - [Rebase] Rebase to vllm v0.18.0 ### vllm-omni-api - Source: [PR #2037](vllm-project/vllm-omni#2037) - [Rebase] Rebase to vllm v0.18.0 ### vllm-omni-cicd - Source: [PR #2037](vllm-project/vllm-omni#2037) - [Rebase] Rebase to vllm v0.18.0 ### vllm-omni-cicd - Source: [PR #2032](vllm-project/vllm-omni#2032) - [CI] Change Bagel online test environment variable `VLLM_TEST_CLEAN_GPU_MEMORY` to `0` ### vllm-omni-cicd - Source: [PR #2031](vllm-project/vllm-omni#2031) - [CI] Fix test. - Changes: - Bug fix: [CI] Fix test. ### vllm-omni-cicd - Source: [PR #2017](vllm-project/vllm-omni#2017) - [CI] [ROCm] Setup `test-ready.yml` and `test-merge.yml` ### vllm-omni-cicd - Source: [PR #2014](vllm-project/vllm-omni#2014) - [Test] Implement mock HTTP request handling in benchmark CLI tests ### vllm-omni-perf - Source: [PR #2014](vllm-project/vllm-omni#2014) - [Test] Implement mock HTTP request handling in benchmark CLI tests ### vllm-omni-serving - Source: [PR #2012](vllm-project/vllm-omni#2012) - [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips - Changes: - Bug fix: [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips ### vllm-omni-image-gen - Source: [PR #2012](vllm-project/vllm-omni#2012) - [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips - Changes: - Bug fix: [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips ### vllm-omni-perf - Source: [PR #2012](vllm-project/vllm-omni#2012) - [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips - Changes: - Bug fix: [Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips ### vllm-omni-serving - Source: [PR #2009](vllm-project/vllm-omni#2009) - [Bugfix] revert PR#1758 which introduced the accuracy problem of qwen3-omni - Changes: - Bug fix: [Bugfix] revert PR#1758 which introduced the accuracy problem of qwen3-omni ### vllm-omni-image-gen - Source: [PR #2007](vllm-project/vllm-omni#2007) - [Bugfix]Fix bug of online server can not return mutli images - Changes: - Bug fix: [Bugfix]Fix bug of online server can not return mutli images - Additions: - Qwen-Image-Layered - Qwen-Image-Layered - Qwen-Image-Layered ### vllm-omni-api - Source: [PR #2007](vllm-project/vllm-omni#2007) - [Bugfix]Fix bug of online server can not return mutli images - Changes: - Bug fix: [Bugfix]Fix bug of online server can not return mutli images ### vllm-omni-cicd - Source: [PR #1998](vllm-project/vllm-omni#1998) - [CI] Split BAGEL tests into dummy/real weight tiers (L2/L3) ### vllm-omni-serving - Source: [PR #1985](vllm-project/vllm-omni#1985) - [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls - Changes: - Performance improvement: [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls ### vllm-omni-audio-tts - Source: [PR #1985](vllm-project/vllm-omni#1985) - [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls - Changes: - Performance improvement: [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls ### vllm-omni-perf - Source: [PR #1985](vllm-project/vllm-omni#1985) - [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls - Changes: - Performance improvement: [Perf] [Qwen3-TTS] Keep audio_codes and last_talker_hidden on GPU to eliminate per-step sync stalls ### vllm-omni-serving - Source: [PR #1984](vllm-project/vllm-omni#1984) - [CI] [ROCm] Bugfix device environment issue - Changes: - Bug fix: [CI] [ROCm] Bugfix device environment issue ### vllm-omni-api - Source: [PR #1984](vllm-project/vllm-omni#1984) - [CI] [ROCm] Bugfix device environment issue - Changes: - Bug fix: [CI] [ROCm] Bugfix device environment issue ### vllm-omni-serving - Source: [PR #1982](vllm-project/vllm-omni#1982) - [Fix] Fix slow hasattr in CUDAGraphWrapper.__getattr__ - Changes: - Bug fix: [Fix] Fix slow hasattr in CUDAGraphWrapper.__getattr__ ### vllm-omni-cicd - Source: [PR #1982](vllm-project/vllm-omni#1982) - [Fix] Fix slow hasattr in CUDAGraphWrapper.__getattr__ - Changes: - Bug fix: [Fix] Fix slow hasattr in CUDAGraphWrapper.__getattr__ ### vllm-omni-api - Source: [PR #1979](vllm-project/vllm-omni#1979) - [Bugfix] Fix config misalignment between offline and online diffusion inference (Wan2.2, Qwen-Image series) - Changes: - Bug fix: [Bugfix] Fix config misalignment between offline and online diffusion inference (Wan2.2, Qwen-Image series) - Additions: - `/v1/chat/completions` ### vllm-omni-perf - Source: [PR #1979](vllm-project/vllm-omni#1979) - [Bugfix] Fix config misalignment between offline and online diffusion inference (Wan2.2, Qwen-Image series) - Changes: - Bug fix: [Bugfix] Fix config misalignment between offline and online diffusion inference (Wan2.2, Qwen-Image series) ### vllm-omni-contrib - Source: [PR #1976](vllm-project/vllm-omni#1976) - [skip ci][Docs] Update WeChat QR code (fix filename case) - Changes: - Bug fix: [skip ci][Docs] Update WeChat QR code (fix filename case) ### vllm-omni-contrib - Source: [PR #1974](vllm-project/vllm-omni#1974) - [Docs] Update WeChat QR code for community support ### vllm-omni-cicd - Source: [PR #1945](vllm-project/vllm-omni#1945) - Fix Base voice clone streaming quality and stop-token crash - Changes: - Bug fix: Fix Base voice clone streaming quality and stop-token crash ### vllm-omni-cicd - Source: [PR #1938](vllm-project/vllm-omni#1938) - [Test] L4 complete diffusion feature test for Bagel models - Changes: - New feature: [Test] L4 complete diffusion feature test for Bagel models ### vllm-omni-perf - Source: [PR #1938](vllm-project/vllm-omni#1938) - [Test] L4 complete diffusion feature test for Bagel models - Changes: - New feature: [Test] L4 complete diffusion feature test for Bagel models ### vllm-omni-perf - Source: [PR #1934](vllm-project/vllm-omni#1934) - Fix OmniGen2 transformer config loading for HF models - Changes: - Bug fix: Fix OmniGen2 transformer config loading for HF models ### vllm-omni-audio-tts - Source: [PR #1930](vllm-project/vllm-omni#1930) - [Bug][Qwen3TTS][Streaming] remove dynamic initial chunk and only compute on initial request ### vllm-omni-perf - Source: [PR #1930](vllm-project/vllm-omni#1930) - [Bug][Qwen3TTS][Streaming] remove dynamic initial chunk and only compute on initial request ### vllm-omni-audio-tts - Source: [PR #1926](vllm-project/vllm-omni#1926) - [Misc] removed qwen3_tts.py as it is out-dated ### vllm-omni-contrib - Source: [PR #1920](vllm-project/vllm-omni#1920) - [Docs] Add Wan2.1-T2V as supported video generation models - Changes: - New feature: [Docs] Add Wan2.1-T2V as supported video generation models ### vllm-omni-video-gen - Source: [PR #1915](vllm-project/vllm-omni#1915) - [Bugfix] fix helios video generate use cpu device - Changes: - Bug fix: [Bugfix] fix helios video generate use cpu device ### vllm-omni-perf - Source: [PR #1915](vllm-project/vllm-omni#1915) - [Bugfix] fix helios video generate use cpu device - Changes: - Bug fix: [Bugfix] fix helios video generate use cpu device ### vllm-omni-audio-tts - Source: [PR #1913](vllm-project/vllm-omni#1913) - [Optim][Qwen3TTS][CodePredictor] support torch.compile with reduce-overhead and dynamic False ### vllm-omni-perf - Source: [PR #1913](vllm-project/vllm-omni#1913) - [Optim][Qwen3TTS][CodePredictor] support torch.compile with reduce-overhead and dynamic False ### vllm-omni-api - Source: [PR #1908](vllm-project/vllm-omni#1908) - [Entrypoint][Refactor] vLLM-Omni Entrypoint Refactoring ### vllm-omni-perf - Source: [PR #1908](vllm-project/vllm-omni#1908) - [Entrypoint][Refactor] vLLM-Omni Entrypoint Refactoring ### vllm-omni-contrib - Source: [PR #1908](vllm-project/vllm-omni#1908) - [Entrypoint][Refactor] vLLM-Omni Entrypoint Refactoring ### vllm-omni-serving - Source: [PR #1908](vllm-project/vllm-omni#1908) - [Entrypoint][Refactor] vLLM-Omni Entrypoint Refactoring ### vllm-omni-cicd - Source: [PR #1908](vllm-project/vllm-omni#1908) - [Entrypoint][Refactor] vLLM-Omni Entrypoint Refactoring ### vllm-omni-image-gen - Source: [PR #1900](vllm-project/vllm-omni#1900) - [Feat] support HSDP for Flux family - Changes: - New feature: [Feat] support HSDP for Flux family ### vllm-omni-contrib - Source: [PR #1900](vllm-project/vllm-omni#1900) - [Feat] support HSDP for Flux family - Changes: - New feature: [Feat] support HSDP for Flux family ### vllm-omni-distributed - Source: [PR #1898](vllm-project/vllm-omni#1898) - [Feature]: Remove some useless `hf_overrides` in yaml - Changes: - New feature: [Feature]: Remove some useless `hf_overrides` in yaml ### vllm-omni-quantization - Source: [PR #1898](vllm-project/vllm-omni#1898) - [Feature]: Remove some useless `hf_overrides` in yaml - Changes: - New feature: [Feature]: Remove some useless `hf_overrides` in yaml ### vllm-omni-cicd - Source: [PR #1898](vllm-project/vllm-omni#1898) - [Feature]: Remove some useless `hf_overrides` in yaml - Changes: - New feature: [Feature]: Remove some useless `hf_overrides` in yaml ### vllm-omni-perf - Source: [PR #1898](vllm-project/vllm-omni#1898) - [Feature]: Remove some useless `hf_overrides` in yaml - Changes: - New feature: [Feature]: Remove some useless `hf_overrides` in yaml ### vllm-omni-contrib - Source: [PR #1890](vllm-project/vllm-omni#1890) - [NPU] Upgrade to v0.17.0 ### vllm-omni-contrib - Source: [PR #1889](vllm-project/vllm-omni#1889) - Add `Governance` section - Changes: - New feature: Add `Governance` section ### vllm-omni-distributed - Source: [PR #1881](vllm-project/vllm-omni#1881) - [Feat] Support T5 Tensor Parallelism - Changes: - New feature: [Feat] Support T5 Tensor Parallelism ### vllm-omni-cicd - Source: [PR #1881](vllm-project/vllm-omni#1881) - [Feat] Support T5 Tensor Parallelism - Changes: - New feature: [Feat] Support T5 Tensor Parallelism

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com>

NumberWan added 2 commits March 17, 2026 10:03

[Test] L4 complete diffusion feature test for Bagel models

e0d9cb4

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

Fixed format

1ca862c

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

yenuo26 mentioned this pull request Mar 17, 2026

[RFC]: Supplement use cases for L1, L3, and L4 JiusiServe/vllm-omni#163

Closed

1 task

NumberWan marked this pull request as ready for review March 17, 2026 03:18

Fixed format

2b6c223

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

chatgpt-codex-connector Bot reviewed Mar 17, 2026

View reviewed changes

Comment thread tests/e2e/online_serving/test_bagel_expansion.py

hsliuustc0106 reviewed Mar 17, 2026

View reviewed changes

princepride requested changes Mar 17, 2026

View reviewed changes

Edited Comments

ad0c6b2

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

Edited Comments

fb78bb4

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

Edited Comments

752d2f2

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

NumberWan requested review from hsliuustc0106 and princepride March 17, 2026 08:55

Changes for the Temp CI test

321c634

Signed-off-by: NumberWan <wantszkin2003@gmail.com>

hsliuustc0106 added the ready label to trigger buildkite CI label Mar 18, 2026

hsliuustc0106 reviewed Mar 18, 2026

View reviewed changes

Comment thread .buildkite/pipeline.yml Outdated

Comment thread .buildkite/test-nightly.yml Outdated

Revert "Changes for the Temp CI test"

01a6af2

This reverts commit 321c634. Signed-off-by: NumberWan <wantszkin2003@gmail.com>

NumberWan force-pushed the bagel_example branch from 22d6dd0 to 01a6af2 Compare March 18, 2026 07:49

Merge branch 'main' into bagel_example

27addc4

yenuo26 mentioned this pull request Mar 19, 2026

[CI] Split BAGEL tests into dummy/real weight tiers (L2/L3) #1998

Merged

princepride approved these changes Mar 19, 2026

View reviewed changes

princepride merged commit ae92ef5 into vllm-project:main Mar 19, 2026
7 checks passed

fhfuih pushed a commit to fhfuih/vllm-omni that referenced this pull request Mar 19, 2026

[Test] L4 complete diffusion feature test for Bagel models (vllm-proj…

ea17afc

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com>

Hu1Lcode pushed a commit to Hu1Lcode/vllm-omni that referenced this pull request Mar 19, 2026

[Test] L4 complete diffusion feature test for Bagel models (vllm-proj…

3752b06

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com> Signed-off-by: Hui <1779066624@qq.com>

fhfuih mentioned this pull request Mar 20, 2026

[RFC]: L4 e2e tests of diffusion models and diffusion features (continuous maintanance) #1832

Open

1 task

yiliu30 pushed a commit to yiliu30/vllm-omni-fork that referenced this pull request Mar 20, 2026

[Test] L4 complete diffusion feature test for Bagel models (vllm-proj…

a2db577

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com> Signed-off-by: yiliu30 <yi4.liu@intel.com>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[Test] L4 complete diffusion feature test for Bagel models (vllm-proj…

26090ab

…ect#1938) Signed-off-by: NumberWan <wantszkin2003@gmail.com>

		@@ -0,0 +1,129 @@
		"""L4 diffusion feature expansion tests for Bagel.

Conversation

NumberWan commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Coverage for Bagel (L4, online_serving)

Test Result

Additional Test Result

CI Test Result

CI Successful in both 4 test

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

NumberWan commented Mar 17, 2026

Uh oh!

hsliuustc0106 commented Mar 17, 2026

Uh oh!

hsliuustc0106 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

NumberWan Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

princepride Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

NumberWan Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

congw729 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

princepride Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

NumberWan Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NumberWan commented Mar 17, 2026

Uh oh!

Uh oh!

Uh oh!

NumberWan commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

NumberWan commented Mar 17, 2026 •

edited

Loading

NumberWan Mar 17, 2026 •

edited

Loading

yenuo26 commented Mar 17, 2026 •

edited

Loading