enable Qwen image Layered on Gaudi. by nc-BobLee · Pull Request #383 · HabanaAI/optimum-habana-fork

nc-BobLee · 2025-12-30T05:58:43Z

No description provided.

Wei-Lin-Intel · 2026-01-05T07:10:11Z

+
+
+    device = 'hpu'
+    model_path = '/mnt/ceph1/libo/hf_models/Qwen-Image-Layered/'


Change the path to something like /data/Qwen-Image-Layered or the default huggingface model id

Wei-Lin-Intel · 2026-01-05T07:18:53Z

+
+from transformers import Qwen2_5_VLForConditionalGeneration, Qwen2Tokenizer, Qwen2VLProcessor
+
+from optimum.habana.transformers.models import GaudiQwen2_5_VLForConditionalGeneration


If it is not used, delete it.

Wei-Lin-Intel · 2026-01-05T07:19:24Z

+from transformers import Qwen2_5_VLForConditionalGeneration, Qwen2Tokenizer, Qwen2VLProcessor
+
+from optimum.habana.transformers.models import GaudiQwen2_5_VLForConditionalGeneration
+from optimum.habana.diffusers.models.qwenimage_transformer import QwenImageTransformer2DModelGaudi,QwenImageTransformerBlockForwardGaudi


Add a space between QwenImageTransformer2DModelGaudi, and QwenImageTransformerBlockForwardGaudi

Wei-Lin-Intel · 2026-01-05T07:19:58Z

+        if use_hpu_graphs:
+            logger.warning(
+                "WARNING:!!!GaudiQwenImageLayeredPipeline HPU graph mode may have OOM problem when image size changes. Please set use_hpu_graphs=False!!!"
+            )


Set use_hpu_graphs=False here to avoid OOM issue

Wei-Lin-Intel · 2026-01-05T07:20:55Z

+        self.transformer.hidden_states_buckets_step = hidden_states_buckets_step
+        self.transformer.encoder_hidden_states_buckets_step = encoder_hidden_states_buckets_step
+
+        #self.to(self._device)


Wei-Lin-Intel · 2026-01-05T07:48:12Z

+from diffusers.utils.torch_utils import randn_tensor
+from diffusers.pipelines.qwenimage import QwenImagePipelineOutput
+from diffusers.models.autoencoders.autoencoder_kl_qwenimage import QwenImageAttentionBlock
+from diffusers.pipelines.qwenimage.pipeline_qwenimage_layered import QwenImageLayeredPipeline,calculate_dimensions,retrieve_timesteps


Add spaces bweteen ,

yingjie-han · 2026-01-05T08:50:48Z

+            theta=10000, axes_dim=list(config["axes_dims_rope"]), scale_rope=True
+        )
+
+        self.vae_decode_latents_buckets = [128,160,188]


Please test different image sizes, and check if these default buckets value is suitable for this model.

yingjie-han · 2026-01-05T08:51:26Z

+                f"vae_decode_latents_buckets is {self.vae_decode_latents_buckets}."
+            )
+
+        self.vae_encode_buckets = [1024,1280,1504]


Please test different image sizes, and check if these default buckets value is suitable for this model.

Wei-Lin-Intel · 2026-01-05T13:40:14Z

+                layer.forwward = types.MethodType(QwenImageAttentionBlockForwardGaudi, layer)
+
+        config = self.transformer.config
+        self.transformer.pos_embed = GaudiQwenEmbedRope(


Be careful of new PR, not everything can be applied from old PR. According to Qwen-Image-Layered Diffusers PR, it uses QwenEmbedLayer3DRope, not QwenEmbedRope. Also please check the other places to make sure everything is migrated.

nc-BobLee added 6 commits December 30, 2025 13:53

enable QwenimageLayered on gaudi.

3d5fb56

refine sdpa attention forward.

24ba8fd

use adapt transformers to gaudi.

320ae1f

add param additional_t_cond

1cf891f

refine text_encoder generate

63220f9

refine example code.

c7e91f3

Wei-Lin-Intel reviewed Jan 5, 2026

View reviewed changes

refine code style

5aac8ca

yingjie-han reviewed Jan 5, 2026

View reviewed changes

Wei-Lin-Intel reviewed Jan 5, 2026

View reviewed changes

nc-BobLee and others added 5 commits January 6, 2026 16:50

add 3D rope.

3d4b644

set Qwen2.5 VL cache static

b9201f2

Add doc for Qwen-Image-Layered HabanaAI#1

984ea50

Add doc for Qwen-Image-Layered HabanaAI#2

941efa4

Add doc for Qwen-Image-Layered HabanaAI#3

e7b826d

Wei-Lin-Intel approved these changes Jan 7, 2026

View reviewed changes

Wei-Lin-Intel merged commit 23dd420 into HabanaAI:aice/v1.22.0 Jan 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable Qwen image Layered on Gaudi.#383

enable Qwen image Layered on Gaudi.#383
Wei-Lin-Intel merged 12 commits into
HabanaAI:aice/v1.22.0from
nc-BobLee:qwenimage_layered_dev

nc-BobLee commented Dec 30, 2025

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

yingjie-han Jan 5, 2026

Uh oh!

yingjie-han Jan 5, 2026

Uh oh!

Wei-Lin-Intel Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		device = 'hpu'
		model_path = '/mnt/ceph1/libo/hf_models/Qwen-Image-Layered/'


		from transformers import Qwen2_5_VLForConditionalGeneration, Qwen2Tokenizer, Qwen2VLProcessor

		from optimum.habana.transformers.models import GaudiQwen2_5_VLForConditionalGeneration

Conversation

nc-BobLee commented Dec 30, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants