[Bugfix] Fix Qwen-Image SP and TeaCache incompatibility by wtomin · Pull Request #2101 · vllm-project/vllm-omni

wtomin · 2026-03-23T14:45:37Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Solving #2092

Root Cause:

In the extract_qwen_context function, the TeaCache extractor directly invokes module.img_in and module.pos_embed, bypassing the image_rope_prepare module. This prevents the Sequence Parallel (SP) SequenceParallelSplitHook from being triggered, resulting in hidden_states not being properly sharded.

When SP is enabled:

The _sp_plan registers a SequenceParallelSplitHook on image_rope_prepare to shard hidden_states after the forward pass

Solution

Modify the extract_qwen_context function in vllm_omni/diffusion/cache/teacache/extractors.py:
Invoke image_rope_prepare: Replace direct calls to img_in and pos_embed to ensure the SP split hook is triggered
Invoke modulate_index_prepare: Replace direct timestep processing to ensure modulate_index is correctly sharded when zero_cond_t=True
Pass modulate_index: Pass modulate_index to the transformer blocks in run_transformer_blocks

This ensures TeaCache is properly compatible with Sequence Parallelism (both Ulysses-SP and Ring Attention).

Test Plan

# ulysses-sp
 python examples/offline_inference/text_to_image/text_to_image.py --model Qwen/Qwen-Image --ulysses-degree 2 --cache-backend tea_cache
# ring attention
python examples/offline_inference/text_to_image/text_to_image.py --model Qwen/Qwen-Image --ring-degree 2 --cache-backend tea_cache

Test Result

SP	TeaCache	e2e latency	Image
usp=2	ON	2.37s
ring=2	ON	2.78s

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 23bda666b7

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-23T14:51:16Z

+    # For zero_cond_t=True: timestep is doubled, modulate_index is created and
+    # sharded by the SequenceParallelSplitHook on modulate_index_prepare so that
+    # its sequence dimension matches the already-sharded hidden_states.
+    timestep, modulate_index = module.modulate_index_prepare(timestep, img_shapes)


Keep TeaCache extractor aligned with zero_cond_t batching

For Qwen checkpoints with zero_cond_t=True (the image-edit variants), ModulateIndexPrepare.forward() doubles timestep here (qwen_image_transformer.py:141-154), so temb becomes 2 * batch. The rest of extract_qwen_context() still consumes that embedding as if it were batch: block.img_norm1(hidden_states, img_mod1) is still called without modulate_index, and postprocess() never chunks temb back down before module.norm_out, unlike QwenImageTransformer2DModel.forward() (qwen_image_transformer.py:1062-1065). With TeaCache enabled, Qwen edit models will therefore fail on the first forward with a batch-dimension mismatch instead of running the transformer.

Useful? React with 👍 / 👎.

SamitHuang

LGTM

wtomin added 2 commits March 23, 2026 22:32

fix qwen-image

16e724e

Signed-off-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

revert

23bda66

Signed-off-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

wtomin marked this pull request as ready for review March 23, 2026 14:46

wtomin requested a review from hsliuustc0106 as a code owner March 23, 2026 14:46

wtomin requested review from ZJY0516 and gcanlin March 23, 2026 14:46

chatgpt-codex-connector Bot reviewed Mar 23, 2026

View reviewed changes

wtomin mentioned this pull request Mar 23, 2026

[Bagel]: Support SP #1903

Merged

SamitHuang added the ready label to trigger buildkite CI label Mar 23, 2026

SamitHuang approved these changes Mar 23, 2026

View reviewed changes

gcanlin approved these changes Mar 23, 2026

View reviewed changes

gcanlin merged commit 5aef6b9 into vllm-project:main Mar 23, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix Qwen-Image SP and TeaCache incompatibility#2101

[Bugfix] Fix Qwen-Image SP and TeaCache incompatibility#2101
gcanlin merged 2 commits intovllm-project:mainfrom
wtomin:fix-teacache

wtomin commented Mar 23, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 23, 2026

Uh oh!

SamitHuang left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wtomin commented Mar 23, 2026

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

SamitHuang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants