[codex][release/v0.18.0.post1] revert Wan2.2 pipeline changes from #2878 by david6666666 · Pull Request #2937 · vllm-project/vllm-omni

david6666666 · 2026-04-20T08:38:03Z

Summary

revert the Wan2.2 pipeline behavior that was backported through #2878
keep the later #2854 release-branch optimization in pipeline_wan2_2_i2v.py
drop the Wan2.2 max-sequence regression test that only covered the reverted behavior

What Changed

restore pipeline_wan2_2.py, pipeline_wan2_2_i2v.py, and pipeline_wan2_2_ti2v.py to the pre-#2878 prompt handling behavior
keep the later image preprocess / mask cleanup from 8a1ff4e9 in pipeline_wan2_2_i2v.py
remove the now-obsolete Wan2.2 max-sequence test added by #2878
keep the repo-wide formatting / unused-import cleanup required by pre-commit

Validation

python -m py_compile vllm_omni/diffusion/models/wan2_2/pipeline_wan2_2.py vllm_omni/diffusion/models/wan2_2/pipeline_wan2_2_i2v.py vllm_omni/diffusion/models/wan2_2/pipeline_wan2_2_ti2v.py
pre-commit run --all-files
E2E serve + /v1/videos re-validation on 4x H20 with the same local environment style used in earlier #2878 comments

E2E Result

Fixed setup for both runs:

model snapshot: /mnt/data1/huggingface/hub/models--Wan-AI--Wan2.2-I2V-A14B-Diffusers/snapshots/596658fd9ca6b7b71d5057529bbf319ecbc61d74
CUDA_VISIBLE_DEVICES=4,5,6,7
--omni --enable-diffusion-pipeline-profiler --ulysses-degree 4
prompt: A white rabbit standing on a wooden table, then slowly turning its head and hopping forward with smooth motion.
size=1280x720, seconds=5, fps=16, num_frames=81, num_inference_steps=4, guidance_scale=3.5, guidance_scale_2=3.5, boundary_ratio=0.875, flow_shift=5.0, seed=42, frame interpolation disabled
fixed input image: /mnt/data4/cwq/tmp/rabbit_real.png

Measured comparison against current release/v0.18.0.post1 head:

release head: server_inference_time_s=113.73233714140952, artifact_ready_wall_s=114.78
this revert branch: server_inference_time_s=113.81139127537608, artifact_ready_wall_s=114.856
text_encoder.forward total from profiler log: 0.047962s -> 0.120787s
DiT step wall time from tqdm: 23.42s/it -> 23.50s/it

Interpretation:

reverting the Wan2.2 prompt-length backport brings text_encoder.forward back to the earlier higher-cost path
overall E2E latency stays nearly flat for this 4-step request, but the revert is slightly slower rather than faster
both outputs keep the same video metadata: 1280x720, 81 frames, 16 fps, 5.0625s

Signed-off-by: david6666666 <530634352@qq.com>

david6666666 · 2026-04-20T08:38:22Z

Supplementary E2E re-validation for this revert.

Setup stayed fixed across both runs:

model snapshot: /mnt/data1/huggingface/hub/models--Wan-AI--Wan2.2-I2V-A14B-Diffusers/snapshots/596658fd9ca6b7b71d5057529bbf319ecbc61d74
CUDA_VISIBLE_DEVICES=4,5,6,7
--omni --enable-diffusion-pipeline-profiler --ulysses-degree 4
prompt: A white rabbit standing on a wooden table, then slowly turning its head and hopping forward with smooth motion.
size=1280x720, seconds=5, fps=16, num_frames=81, num_inference_steps=4, guidance_scale=3.5, guidance_scale_2=3.5, boundary_ratio=0.875, flow_shift=5.0, seed=42, frame interpolation disabled
fixed input image: /mnt/data4/cwq/tmp/rabbit_real.png

Measured comparison vs current release/v0.18.0.post1 head:

release head: server_inference_time_s=113.73233714140952, artifact_ready_wall_s=114.78
revert branch: server_inference_time_s=113.81139127537608, artifact_ready_wall_s=114.856
text_encoder.forward total: 0.047962s -> 0.120787s
pipeline.forward: 111.590084s -> 111.948706s
DiT step tqdm wall time: 23.42s/it -> 23.50s/it
peak reserved GPU memory: 88.29GB -> 88.30GB

Conclusion:

this revert restores the earlier Wan2.2 prompt path, so text_encoder.forward regresses as expected
end-to-end latency for this short 4-step request stays nearly flat, but the revert is slightly slower rather than faster
both outputs still decode as 1280x720 / 81 frames / 16 fps

Additional local accuracy validation on worktree/issue2874-wan22-max-seq/tests/e2e/accuracy/wan22_i2v:

command:
- cd /mnt/data4/cwq/worktree/issue2874-wan22-max-seq
- PYTHONPATH=/mnt/data4/cwq/worktree/issue2874-wan22-max-seq /mnt/data4/cwq/.venv/bin/python -m pytest -q tests/e2e/accuracy/wan22_i2v -s
result: 16 passed in 3047.54s (0:50:47)
similarity metrics:
- SSIM=0.967964 (threshold >= 0.94)
- PSNR=37.894881 dB (threshold >= 28.0 dB)
online serving case:
- usp=2, hsdp-shard-size=2
- online_video_e2e_latency_s=833.868
artifact paths:
- tests/e2e/accuracy/wan22_i2v/result/rabbit-cf925a4c/online.mp4
- tests/e2e/accuracy/wan22_i2v/result/rabbit-cf925a4c/offline.mp4
- tests/e2e/accuracy/wan22_i2v/result/rabbit-cf925a4c/offline_metadata.json
notes:
- pytest completed successfully despite a non-fatal resource_tracker warning during server shutdown
- the run also emitted a vLLM-Omni / vLLM major-minor mismatch warning, but the full accuracy suite still passed

hsliuustc0106 · 2026-04-20T09:31:52Z

Ready for full review when draft status removed. Preliminary scan available on request.

chatgpt-codex-connector · 2026-04-20T11:08:24Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

david6666666 · 2026-04-20T11:08:51Z

gcanlin

LGTM

[Revert] drop Wan2.2 pipeline backport changes from PR2878

760df6c

Signed-off-by: david6666666 <530634352@qq.com>

david6666666 added the diffusion-x2v-test label to trigger buildkite x2video series of diffusion models test in nightly CI label Apr 20, 2026

david6666666 mentioned this pull request Apr 20, 2026

[RFC][0.20.0]: Qwen-Image、Qwen-Image-Layered、Qwen-Image-Edit-Plus、Wan2.2 Production-grade Feature Monitoring JiusiServe/vllm-omni#181

Closed

1 task

david6666666 removed the diffusion-x2v-test label to trigger buildkite x2video series of diffusion models test in nightly CI label Apr 20, 2026

david6666666 marked this pull request as ready for review April 20, 2026 11:08

david6666666 requested a review from hsliuustc0106 as a code owner April 20, 2026 11:08

david6666666 merged commit 89f733d into vllm-project:release/v0.18.0.post1 Apr 20, 2026
3 checks passed

gcanlin approved these changes Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex][release/v0.18.0.post1] revert Wan2.2 pipeline changes from #2878#2937

[codex][release/v0.18.0.post1] revert Wan2.2 pipeline changes from #2878#2937
david6666666 merged 1 commit into
vllm-project:release/v0.18.0.post1from
david6666666:codex/revert-pr2878-wan22-release-v0180p1

david6666666 commented Apr 20, 2026

Uh oh!

david6666666 commented Apr 20, 2026 •

edited

Loading

Uh oh!

hsliuustc0106 commented Apr 20, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 20, 2026

Uh oh!

david6666666 commented Apr 20, 2026

Uh oh!

Uh oh!

gcanlin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

david6666666 commented Apr 20, 2026

Summary

What Changed

Validation

E2E Result

Uh oh!

david6666666 commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hsliuustc0106 commented Apr 20, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 20, 2026

Uh oh!

david6666666 commented Apr 20, 2026

Uh oh!

Uh oh!

gcanlin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

david6666666 commented Apr 20, 2026 •

edited

Loading