[cherry-pick][release/v0.18.0.post1] cherry-pick #2847 #2780 #2840 #2876 #2877#2878
Conversation
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit adda9a6)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 281e14a)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 66151f0)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 1e8fa70)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 0a6d618)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit bd9bfaf)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 21851d6)
Signed-off-by: David Chen <530634352@qq.com> (cherry picked from commit 896b0b8)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 0e2f009)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit eec0785)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 72af603)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit f1900fe)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit c95d20c)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 731c536)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 8c857c3)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 0c25a06)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 826c74a)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 297d06b)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 4ea2271)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit f414061)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 05a7a5d)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit f3e7ce9)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit 3015646)
Signed-off-by: david6666666 <530634352@qq.com> (cherry picked from commit ecbb6d4)
Signed-off-by: david6666666 <530634352@qq.com>
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
|
Supplemental validation for the ordered backport onto Cherry-pick order used:
Local test environment note:
Static / unit validation:
E2E for
E2E for
E2E for
Backport note for
|
Signed-off-by: david6666666 <530634352@qq.com>
|
Update: I cherry-picked Additional commits on this branch:
Additional validation after adding
Re-validation of the original
Relevant server log evidence from this exact rerun:
So after adding |
|
BLOCKING ISSUE: This PR cherry-picks unmerged PRs (#2840, #2876) from main to the release branch. Release branches should only receive changes that have been proven on main. Cherry-picking open PRs bypasses the normal review process and can introduce unverified code. Please wait until #2840 and #2876 are reviewed and merged to main, then cherry-pick from there. |
|
Cherry-pick validation looks comprehensive. One concern: cherry-picking multiple PRs together can make conflict resolution fragile. When this lands, verify the backport doesn't create divergence issues with main branch behavior - especially the Wan2.2 max_sequence_length changes (#2847 + #2877 interaction) which were called out in the notes. Suggestion for future release branch work: Consider landing PRs individually when possible to reduce merge conflict surface area. |
|
UT is broken in v0.18.0.post1. Considering the quality of release, would be better to fix them. |
|
2847 2840 passed verification |
Signed-off-by: david6666666 <530634352@qq.com>
|
Follow-up for the Wan2.2 short-prompt performance regression observed on this backport branch. Root cause
Fix
Added regression test
Validation
E2E re-validation (same local environment style as the earlier PR comments)
Measured result for the fixed branch
Comparison against the earlier measurements collected on April 19, 2026
Profiler evidence from the fixed run
Conclusion
|
…-pr2847-2780-2840-2876 Signed-off-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>
2116e88
into
vllm-project:release/v0.18.0.post1
(#2937) Signed-off-by: david6666666 <530634352@qq.com>
Summary
release/v0.18.0.post1in that orderpre-commit run --all-filesValidation
python -m py_compile vllm_omni/engine/async_omni_engine.py tests/entrypoints/test_async_omni_diffusion_config.py tests/entrypoints/openai_api/test_image_server.pypython -m pytest -q tests/diffusion/models/qwen_image/test_qwen_image_max_sequence_length.py tests/diffusion/models/wan2_2/test_wan22_max_sequence_length.pypython -m pytest -q tests/diffusion/models/qwen_image/test_qwen_image_edit_plus.pypython -m pytest -q tests/entrypoints/openai_api/test_video_api_utils.pypython -m pytest -q tests/entrypoints/test_async_omni_diffusion_config.py tests/entrypoints/openai_api/test_image_server.pypre-commit run --all-filesNotes
max_sequence_length=512after cherry-picking [Bugfix] enforce max_sequence_length for Qwen-Image and Wan2.2 series before encoding #2847 into the same branch. I kept the 2876 serve command and runtime settings unchanged, confirmed that failure on the combined branch, then reran the same 2876 validation flow with a shorter prompt to verify RIFE still loads on CUDA.