Improve control net block index for sd3 #9758

linjiapro · 2024-10-23T22:30:34Z

What does this PR do?

The layer configuration for the Control Net in Stable Diffusion 3 models must adhere to the rule that the total number of layers in the SD3 model should be a multiple of the Control Net's layer count.

For SD3.5, which has 38 layers, the Control Net can only have three possible options: 2, 19, or 38. This leads to inefficiencies in the setup.

Also, qk_norm, context_pre_only_last_layer, dual_attention_layers are added to match the transformer architecture

Who can review?

@sayakpaul @yiyixuxu @DN6

linjiapro · 2024-10-23T22:36:34Z

cc @sayakpaul @yiyixuxu @DN6

linjiapro · 2024-10-23T22:44:25Z

This is a very simple PR, but for some reason, the tests all failed for wired reasons such as:

Unable to find self-hosted runner group: 'aws-general-8-plus'.

sayakpaul · 2024-10-24T00:51:40Z

Thanks for your contributions! Could you maybe also add a test for this?

HuggingFaceDocBuilderDev · 2024-10-24T00:59:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

linjiapro · 2024-10-24T06:35:22Z

@sayakpaul

There is an existing test for the pipeline of SD3 control net, I leveraged that. The layer number for the control net changed from 1 to 3, the number of layer of the transformer (4) is no longer the multiples of the number of layers of control net. This will test the code changes of this PR.

sayakpaul

Thanks, I left some comments.

sayakpaul · 2024-10-24T14:51:37Z

.gitignore

@@ -102,6 +102,7 @@ venv/
 ENV/
 env.bak/
 venv.bak/
+myenv/


Should be removed.

sayakpaul · 2024-10-24T14:52:31Z

tests/pipelines/controlnet_sd3/test_controlnet_sd3.py

@@ -77,7 +77,7 @@ def get_dummy_components(self):
            sample_size=32,
            patch_size=1,
            in_channels=8,
-            num_layers=1,
+            num_layers=3,


We should change this value here I think. Instead, we could make this method accept an argument like num_controlnet_layers and then leverage it as needed. WDYT?

linjiapro · 2024-11-15T09:12:55Z

@sayakpaul @yiyixuxu, can we take a look at this? Thanks

bghira · 2024-11-16T16:13:33Z

maybe @DN6 is more active.

yiyixuxu · 2024-11-17T16:28:26Z

src/diffusers/models/transformers/transformer_sd3.py

@@ -344,7 +345,8 @@ def custom_forward(*inputs):

            # controlnet residual
            if block_controlnet_hidden_states is not None and block.context_pre_only is False:
-                interval_control = len(self.transformer_blocks) // len(block_controlnet_hidden_states)
+                interval_control = len(self.transformer_blocks) / len(block_controlnet_hidden_states)


why are we making this change? it is not the same so a breaking change, no?

@yiyixuxu Good question.

The revised code adapts the strategy used by ControlNet for Flux, introducing a significant improvement in flexibility. Here's why this change matters:

In the old code, the number of transformer layers is divisible by the number of ControlNet layers. For example, with SD3.5 Large, which has 38 transformer layers, there were only two valid options for the number of ControlNet layers: 2 and 19. Setting the number of ControlNet layers to anything else, such as 5, would cause the old code to crash.

However, the Flux ControlNet approach removes this restriction, allowing greater flexibility in choosing the number of layers. The revised logic essentially mirrors the Flux implementation, enabling more versatile configurations.

Importantly, the new code maintains compatibility with existing setups. If the number of transformer layers is divisible by the number of ControlNet layers, the interval_control remains unchanged, ensuring all previous configurations continue to function seamlessly.

thanks! I think it's indeed better, I'm just wondering if it would cause issue for controlnet is trained with the current logic
cc @haofanwang here

I don't think it will cause any issue with the trained controlnet using old code before this PR.

The reason is that for the controlnet to be trained with the old code, the number of layers of the transformer has to be divisible by the number of layers of the controlnet, and the new logic after this PR does not change the behavior for the above scenario.

src/diffusers/models/controlnets/controlnet_sd3.py

Co-authored-by: YiYi Xu <[email protected]>

yiyixuxu · 2024-11-20T09:38:23Z

can you run make style and make fix-copies?

improve control net index

a0d199a

wip

48b4b62

sayakpaul reviewed Oct 24, 2024

View reviewed changes

sayakpaul requested a review from yiyixuxu October 24, 2024 14:52

linjiapro added 12 commits October 24, 2024 12:58

wip

d1a1ebe

wip

bd32b2b

wip

a7ffec7

wip

235b800

wip

50b4db9

add rms_norm to controlnet test

933ecf3

wip

9d33417

wip

cd3069c

wip

e40bd61

merge

4c1d293

format

d8006c3

wip

b4983cb

linjiapro mentioned this pull request Nov 15, 2024

Can we get more schedulers for flow based models such as SD3, SD3.5, and flux #9924

Open

yiyixuxu reviewed Nov 17, 2024

View reviewed changes

yiyixuxu reviewed Nov 20, 2024

View reviewed changes

src/diffusers/models/controlnets/controlnet_sd3.py Outdated Show resolved Hide resolved

Update src/diffusers/models/controlnets/controlnet_sd3.py

5f560ca

Co-authored-by: YiYi Xu <[email protected]>

yiyixuxu added the close-to-merge label Nov 20, 2024

yiyixuxu approved these changes Nov 20, 2024

View reviewed changes

yiyixuxu mentioned this pull request Nov 20, 2024

support sd3.5 for controlnet example #9860

Open

6 tasks

style

570558b

yiyixuxu mentioned this pull request Nov 20, 2024

add expected parameters to controlnet_sd3 #9974

Closed

yiyixuxu merged commit 1235862 into huggingface:main Nov 20, 2024
13 of 15 checks passed

yiyixuxu removed the close-to-merge label Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve control net block index for sd3 #9758

Improve control net block index for sd3 #9758

linjiapro commented Oct 23, 2024 •

edited

Loading

linjiapro commented Oct 23, 2024

linjiapro commented Oct 23, 2024

sayakpaul commented Oct 24, 2024

HuggingFaceDocBuilderDev commented Oct 24, 2024

linjiapro commented Oct 24, 2024

sayakpaul left a comment

sayakpaul Oct 24, 2024

linjiapro Oct 25, 2024

sayakpaul Oct 24, 2024

linjiapro Oct 25, 2024

linjiapro commented Nov 15, 2024 •

edited

Loading

bghira commented Nov 16, 2024

yiyixuxu Nov 17, 2024

linjiapro Nov 18, 2024

yiyixuxu Nov 20, 2024

linjiapro Nov 20, 2024 •

edited

Loading

yiyixuxu commented Nov 20, 2024

Improve control net block index for sd3 #9758

Improve control net block index for sd3 #9758

Conversation

linjiapro commented Oct 23, 2024 • edited Loading

What does this PR do?

Who can review?

linjiapro commented Oct 23, 2024

linjiapro commented Oct 23, 2024

sayakpaul commented Oct 24, 2024

HuggingFaceDocBuilderDev commented Oct 24, 2024

linjiapro commented Oct 24, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul Oct 24, 2024

Choose a reason for hiding this comment

linjiapro Oct 25, 2024

Choose a reason for hiding this comment

sayakpaul Oct 24, 2024

Choose a reason for hiding this comment

linjiapro Oct 25, 2024

Choose a reason for hiding this comment

linjiapro commented Nov 15, 2024 • edited Loading

bghira commented Nov 16, 2024

yiyixuxu Nov 17, 2024

Choose a reason for hiding this comment

linjiapro Nov 18, 2024

Choose a reason for hiding this comment

yiyixuxu Nov 20, 2024

Choose a reason for hiding this comment

linjiapro Nov 20, 2024 • edited Loading

Choose a reason for hiding this comment

yiyixuxu commented Nov 20, 2024

linjiapro commented Oct 23, 2024 •

edited

Loading

linjiapro commented Nov 15, 2024 •

edited

Loading

linjiapro Nov 20, 2024 •

edited

Loading