[Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage by Fishermanykx · Pull Request #2762 · vllm-project/vllm-omni

Fishermanykx · 2026-04-14T03:06:03Z

Purpose

This PR is intended to adjust HunyuanImage3's default behavior that always generated the negative/unconditional branch, so generation can run in single-branch mode when guidance is not enabled.

What This PR Changes

Guidance behavior
- Allows guidance_scale <= 1.0 without forcing it to 1.0 + epsilon.
- This enables true non-CFG behavior for low-guidance requests.
CFG factor control
- Changes cfg_factor for gen_image from a fixed value to dynamic gating
- CFG duplication is now enabled only when guidance is actually greater than 1.
Tensor layout robustness
- Replaces view(...) with reshape(...) in the HunyuanImage3 attention output path to avoid runtime errors when tensors are non-contiguous.

Test Plan

Tested on 4x Ascend NPU with v0.18.0.post1 vllm omni

Online

vllm serve "/data/HunyuanImage-3.0/" --omni --port "8091" \
    --tensor_parallel_size 4  \
    --log-stats \
    --stage-configs-path "vllm_omni/platforms/npu/stage_configs/hunyuan_image3_moe_dit.yaml"

client

curl -X POST http://localhost:8091/v1/images/generations \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": 
        "A cinematic medium shot captures a single Asian woman seated on a chair within a dimly lit room, creating an intimate and theatrical atmosphere. The composition is focused on the subject, rendered with rich colors and intricate textures that evoke a nostalgic and moody feeling.\n\nThe primary subject is a young Asian woman with a thoughtful and expressive countenance, her gaze directed slightly away from the camera. She is seated in a relaxed yet elegant posture on an ornate, vintage armchair. The chair is upholstered in a deep red velvet, its fabric showing detailed, intricate textures and slight signs of wear. She wears a simple, elegant dress in a dark teal hue, the material catching the light in a way that reveals its fine-woven texture. Her skin has a soft, matte quality, and the light delicately models the contours of her face and arms.\n\nThe surrounding room is characterized by its vintage decor, which contributes to the historic and evocative mood. In the immediate background, partially blurred due to a shallow depth of field consistent with a f/2.8 aperture, the wall is covered with wallpaper featuring a subtle, damask pattern. The overall color palette is a carefully balanced interplay of deep teal and rich red hues, creating a visually compelling and cohesive environment. The entire scene is detailed, from the fibers of the upholstery to the subtle patterns on the wall.\n\nThe lighting is highly dramatic and artistic, defined by high contrast and pronounced shadow play. A single key light source, positioned off-camera, projects gobo lighting patterns onto the scene, casting intricate shapes of light and shadow across the woman and the back wall. These dramatic shadows create a strong scense of depth and a theatrical quality. While some shadows are deep and defined, others remain soft, gently wrapping around the subject and preventing the loss of detail in darker areas. The soft focus on the background enhances the intimate feeling, drawing all attention to the expressive subject. The overall image presents a cinematic, photorealistic photography style.",
    "num_inference_steps": 50,
    "guidance_scale": "1.0",
    "n": 1,
    "size": "1024x1024",
    "seed": 42
  }' | jq -r '.data[0].b64_json' | base64 -d > output.png

test plan without vllm omni

torchrun --master_port=10086 --nproc_per_node 4 run_image_gen.py --reproduce --model-id $model  --verbose 0 --image-size 1024x1024 --diff-infer-steps 50 --prompt $prompt 2>&1 | tee "./logs/$(date +%Y%m%d_%H%M%S).log"

Test Result

guidance scale	E2E
5.0	27.256s
1.0	15.895s

guidance scale = 1.0 with vllm omni

guidance scale = 1.0 without vllm omni

chatgpt-codex-connector · 2026-04-14T06:59:48Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

Fishermanykx · 2026-04-14T07:03:14Z

@gcanlin @Bounty-hunter PTAL

gcanlin · 2026-04-15T01:48:49Z

Why do we think that hunyuan-image doesn't support guidance_scale <= 1 before?

Fishermanykx · 2026-04-15T02:04:55Z

Why do we think that hunyuan-image doesn't support guidance_scale <= 1 before?

In line 1013 of vllm_omni/diffusion/models/hunyuan_image_3/pipeline_hunyuan_image_3.py, if guidance_scale <= 1, it will be set to 1.0 + np.finfo(float).eps

gcanlin · 2026-04-15T03:14:25Z

@@ -544,7 +543,7 @@ def prepare_model_inputs(
            generator = [torch.Generator(self.device).manual_seed(seed) for seed in seeds]

        # 3. apply chat template
-        cfg_factor = {"gen_text": 1, "gen_image": 2}
+        cfg_factor = {"gen_text": 1, "gen_image": 1 + int(guidance_scale > 1.0)}


Is this line equivalent to the official line below?
https://github.com/Tencent-Hunyuan/HunyuanImage-3.0/blob/d280425cf453a153e5846c725af58de39c10b09f/hunyuan_image_3/hunyuan_image_3_pipeline.py#L776

I think it is

gcanlin

LGTM

Signed-off-by: KexiongYu <yukexiong1@huawei.com>

…oject#2762) Signed-off-by: KexiongYu <yukexiong1@huawei.com>

Fishermanykx changed the title ~~[WIP] [Fix] HunyuanImage3 guidance_scale<=1 and cfg-factor gating~~ [WIP] HunyuanImage3 allow guidance_scale<=1 Apr 14, 2026

Fishermanykx changed the title ~~[WIP] HunyuanImage3 allow guidance_scale<=1~~ [WIP] [Feature] HunyuanImage3 allow guidance_scale<=1 Apr 14, 2026

Fishermanykx changed the title ~~[WIP] [Feature] HunyuanImage3 allow guidance_scale<=1~~ [WIP] [Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage Apr 14, 2026

Fishermanykx force-pushed the yukexiong/hunyuan_guidance_scale_le1 branch 2 times, most recently from b0b20e2 to 9108826 Compare April 14, 2026 06:53

Fishermanykx marked this pull request as ready for review April 14, 2026 06:59

Fishermanykx requested a review from hsliuustc0106 as a code owner April 14, 2026 06:59

Fishermanykx changed the title ~~[WIP] [Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage~~ [Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage Apr 14, 2026

Fishermanykx force-pushed the yukexiong/hunyuan_guidance_scale_le1 branch 3 times, most recently from a8ab210 to 178f23c Compare April 15, 2026 01:37

gcanlin reviewed Apr 15, 2026

View reviewed changes

gcanlin approved these changes Apr 15, 2026

View reviewed changes

gcanlin added the ready label to trigger buildkite CI label Apr 15, 2026

Fishermanykx force-pushed the yukexiong/hunyuan_guidance_scale_le1 branch from 178f23c to 90f3077 Compare April 15, 2026 03:35

Fishermanykx added 3 commits April 15, 2026 14:21

[Fix] Allow guidance_scale <= 1.0 in HunyuanImage3

de52b0c

Signed-off-by: KexiongYu <yukexiong1@huawei.com>

[Fix] HunyuanImage3 guidance-scale gating and non-contiguous attn output

0adabaf

Signed-off-by: KexiongYu <yukexiong1@huawei.com>

clean code

d2091f9

Signed-off-by: KexiongYu <yukexiong1@huawei.com>

Fishermanykx force-pushed the yukexiong/hunyuan_guidance_scale_le1 branch from 90f3077 to d2091f9 Compare April 15, 2026 06:22

gcanlin merged commit 50ae1de into vllm-project:main Apr 15, 2026
8 checks passed

y123456y78 pushed a commit to y123456y78/vllm-omni that referenced this pull request Apr 15, 2026

[Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage (vllm-pr…

1a85360

…oject#2762) Signed-off-by: KexiongYu <yukexiong1@huawei.com>

lvliang-intel pushed a commit to lvliang-intel/vllm-omni that referenced this pull request Apr 20, 2026

[Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage (vllm-pr…

ab5e416

…oject#2762) Signed-off-by: KexiongYu <yukexiong1@huawei.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage#2762

[Feature] HunyuanImage3 allow guidance_scale<=1 in DiT stage#2762
gcanlin merged 3 commits intovllm-project:mainfrom
Fishermanykx:yukexiong/hunyuan_guidance_scale_le1

Fishermanykx commented Apr 14, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 14, 2026

Uh oh!

Fishermanykx commented Apr 14, 2026

Uh oh!

gcanlin commented Apr 15, 2026

Uh oh!

Fishermanykx commented Apr 15, 2026 •

edited

Loading

Uh oh!

gcanlin Apr 15, 2026

Uh oh!

Fishermanykx Apr 15, 2026

Uh oh!

gcanlin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Fishermanykx commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

What This PR Changes

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 14, 2026

Uh oh!

Fishermanykx commented Apr 14, 2026

Uh oh!

gcanlin commented Apr 15, 2026

Uh oh!

Fishermanykx commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gcanlin Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Fishermanykx Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

gcanlin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fishermanykx commented Apr 14, 2026 •

edited

Loading

Fishermanykx commented Apr 15, 2026 •

edited

Loading