[Model]: add FLUX.2-dev model by nuclearwu · Pull Request #1630 · vllm-project/vllm-omni

nuclearwu · 2026-03-03T07:07:06Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

support https://huggingface.co/black-forest-labs/FLUX.2-dev

Test Plan

vLLM-Omni:

python examples/offline_inference/text_to_image/text_to_image.py \
  --model /workspace/cache/ymttest/johnjan/models/black-forest-labs/FLUX___2-dev/ \
  --prompt "a lovely bunny holding a sign that says 'vllm-omni'" \
  --seed 42 \
  --tensor-parallel-size 2 \
  --num-images-per-prompt 1 \
  --num-inference-steps 50 \
  --guidance-scale 4.0 \
  --height 1024 \
  --width 1024 \
  --output outputs/flux2-dev.png

Test Result

vLLM-Omni:
Reproduced with 4xA800.

Model/TP	diffusers	TP=1	TP=2	TP=4
Flux.2-dev		OOM
Time	104.9411s/img	OOM	39.1087s/img	29.0770s/img

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

# Conflicts: # docs/user_guide/diffusion_acceleration.md

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a53145a246

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-03T07:13:59Z

+                """Taking only the first image for now.""",
+            )
+        first_prompt = req.prompts[0]
+        prompt = first_prompt if isinstance(first_prompt, str) else (first_prompt.get("prompt") or "")


Allow embedding-only prompts by preserving None prompt

When a request supplies prompt_embeds without a text prompt, this line coerces missing text to "" instead of keeping it None. That makes check_inputs think both prompt and prompt_embeds were provided and raises a ValueError, so embedding-only inference for Flux2Pipeline is effectively broken.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-03T07:13:59Z

+            # compute the previous noisy sample x_t -> x_t-1
+            latents = self.scheduler.step(noise_pred, t, latents, return_dict=False)[0]


Invoke step-end callback during denoising iterations

The pipeline accepts callback_on_step_end and validates callback_on_step_end_tensor_inputs, but the denoising loop never invokes the callback. Any caller relying on per-step hooks (progress reporting, custom stopping, or latent inspection) will silently lose that behavior in Flux2Pipeline compared with other diffusion pipelines in this repo.

Useful? React with 👍 / 👎.

mergify · 2026-03-03T07:14:41Z

⚠️ The sha of the head commit of this PR conflicts with #1629. Mergify cannot evaluate rules on this PR. Once #1629 is merged or closed, Mergify will resume processing this PR. ⚠️

nuclearwu added 9 commits February 5, 2026 10:20

[feature]: support flux2.klein cache_dit

656f2b1

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

[feature]: support flux2.klein cache_dit

6e04607

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

[feature]: support flux2.klein cache_dit

b626856

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

[feature]: support flux2.klein cache_dit

b050236

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

Merge branch 'main' of github.com:vllm-project/vllm-omni

469df76

# Conflicts: # docs/user_guide/diffusion_acceleration.md

[feature]: support flux2.klein cache_dit

8cfccd8

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

Merge branch 'main' of github.com:vllm-project/vllm-omni

7134cec

[feature]: support Flux.2-dev model

0422663

Signed-off-by: wuzhongjian <wuzhongjian_yewu@cmss.chinamobile.com>

Merge branch 'main' of github.com:vllm-project/vllm-omni

a53145a

nuclearwu requested a review from hsliuustc0106 as a code owner March 3, 2026 07:07

nuclearwu closed this Mar 3, 2026

chatgpt-codex-connector Bot reviewed Mar 3, 2026

View reviewed changes

nuclearwu deleted the flux2-dev branch March 11, 2026 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model]: add FLUX.2-dev model #1630

[Model]: add FLUX.2-dev model #1630
nuclearwu wants to merge 9 commits intovllm-project:mainfrom
nuclearwu:flux2-dev

nuclearwu commented Mar 3, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 3, 2026

Uh oh!

chatgpt-codex-connector Bot Mar 3, 2026

Uh oh!

mergify Bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		# compute the previous noisy sample x_t -> x_t-1
		latents = self.scheduler.step(noise_pred, t, latents, return_dict=False)[0]

Conversation

nuclearwu commented Mar 3, 2026

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

mergify Bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant