Skip to content

[Feature]: support Flux.2-dev tea_cache#1871

Merged
hsliuustc0106 merged 13 commits into
vllm-project:mainfrom
nuclearwu:flux2
Apr 13, 2026
Merged

[Feature]: support Flux.2-dev tea_cache#1871
hsliuustc0106 merged 13 commits into
vllm-project:mainfrom
nuclearwu:flux2

Conversation

@nuclearwu
Copy link
Copy Markdown
Contributor

@nuclearwu nuclearwu commented Mar 13, 2026

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Ref #1806, support Flux.2-dev tea_cache

Test Plan

python examples/offline_inference/text_to_image/text_to_image.py \
  --model /workspace/cache/ymttest/johnjan/models/black-forest-labs/FLUX___2-dev/ \
  --prompt "a lovely bunny holding a sign that says 'vllm-omni'" \
  --seed 42 \
  --tensor-parallel-size 4 \
  --num-images-per-prompt 1 \
  --num-inference-steps 50 \
  --guidance-scale 4.0 \
  --height 1024 \
  --width 1024 \
  --cache-backend tea_cache \
  --output outputs/flux2-dev-teacache.png

Test Result

vLLM-Omni:
Reproduced with 4xA800.

Config Flux.2-dev Baseline With Tea-Cache Speedup
TP=1 & CPU offload flux2-dev-teacache 90.5477s 49.3883s 1.83x
TP=2 & CPU offload flux2-dev-teacache 62.3353s 35.4450s 1.76x
TP=4 flux2-dev-teacache 28.8728s 7.9040s 3.65x

Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
  • The test results. Please paste the results comparison before and after, or the e2e results.
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
  • (Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

@nuclearwu
Copy link
Copy Markdown
Contributor Author

nuclearwu commented Mar 13, 2026

@Gaohan123
Copy link
Copy Markdown
Collaborator

@wtomin @SamitHuang @ZJY0516 PTAL

@Gaohan123 Gaohan123 added this to the v0.18.0 milestone Mar 21, 2026
Copy link
Copy Markdown
Collaborator

@Gaohan123 Gaohan123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please supplement UT for this feature. Thanks

@Gaohan123 Gaohan123 removed this from the v0.18.0 milestone Mar 24, 2026
Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>
Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>
# Conflicts:
#	vllm_omni/diffusion/cache/teacache/config.py
@nuclearwu
Copy link
Copy Markdown
Contributor Author

Please supplement UT for this feature. Thanks

@Gaohan123 Done

@nuclearwu nuclearwu requested a review from Gaohan123 March 30, 2026 02:03
# Conflicts:
#	docs/user_guide/diffusion_acceleration.md
@nuclearwu
Copy link
Copy Markdown
Contributor Author

@Gaohan123 @hsliuustc0106 please review again.

@nuclearwu
Copy link
Copy Markdown
Contributor Author

cc @Gaohan123 @hsliuustc0106

@hsliuustc0106 hsliuustc0106 added the ready label to trigger buildkite CI label Apr 3, 2026
@nuclearwu
Copy link
Copy Markdown
Contributor Author

nuclearwu commented Apr 3, 2026

@hsliuustc0106 The CI error (Docker container timeout with exit code 124) has no direct logical relationship with the PR code changes

@nuclearwu
Copy link
Copy Markdown
Contributor Author

cc @hsliuustc0106 @Gaohan123

@hsliuustc0106
Copy link
Copy Markdown
Collaborator

resolve conflicts please

# Conflicts:
#	tests/diffusion/cache/test_teacache_extractors.py
Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>
@nuclearwu
Copy link
Copy Markdown
Contributor Author

resolve conflicts please

@hsliuustc0106 Done

@nuclearwu
Copy link
Copy Markdown
Contributor Author

cc @hsliuustc0106

# Conflicts:
#	docs/user_guide/diffusion_features.md
Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>
@wtomin
Copy link
Copy Markdown
Collaborator

wtomin commented Apr 12, 2026

There is a known bug when both teacache and cpu-offload are enabled. #1868. I think it will be fixed soon by #2689. Please test with your implementation again after this bugfix.

@nuclearwu
Copy link
Copy Markdown
Contributor Author

nuclearwu commented Apr 13, 2026

There is a known bug when both teacache and cpu-offload are enabled. #1868. I think it will be fixed soon by #2689. Please test with your implementation again after this bugfix.

@wtomin @hsliuustc0106 OK, Done. The verification results are shown in the above table and the bugfix is effective.

Copy link
Copy Markdown
Collaborator

@wtomin wtomin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@hsliuustc0106 hsliuustc0106 merged commit cd2761e into vllm-project:main Apr 13, 2026
8 checks passed
Celeste-jq pushed a commit to IsleOfDawnlight/vllm-omni-voxcpm that referenced this pull request Apr 14, 2026
Co-authored-by: wuzhongjian <wuzhongjian@cmss.chinamobile.com>
lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026
Co-authored-by: wuzhongjian <wuzhongjian@cmss.chinamobile.com>
clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026
Co-authored-by: wuzhongjian <wuzhongjian@cmss.chinamobile.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready label to trigger buildkite CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants