add expected parameters to controlnet_sd3 #9974

bigbraindump · 2024-11-20T12:21:01Z

Describe the bug

the transformer model introduced in SD3 expects the below parameters (transformer_sd3.py). there are two missing parameters that remain undefined in the SD3ControlNetModel class (controlnet_sd3.py) - dual_attention_layers and qk_norm.

@register_to_config
    def __init__(
        self,
        sample_size: int = 128,
        patch_size: int = 2,
        in_channels: int = 16,
        num_layers: int = 18,
        attention_head_dim: int = 64,
        num_attention_heads: int = 18,
        joint_attention_dim: int = 4096,
        caption_projection_dim: int = 1152,
        pooled_projection_dim: int = 2048,
        out_channels: int = 16,
        pos_embed_max_size: int = 96,
        dual_attention_layers: Tuple[
            int, ...
        ] = (),  # () for sd3.0; (0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12) for sd3.5
        qk_norm: Optional[str] = None,
    ):
        super().__init__()
        default_out_channels = in_channels
        self.out_channels = out_channels if out_channels is not None else default_out_channels
        self.inner_dim = self.config.num_attention_heads * self.config.attention_head_dim

Reproduction

class SD3ControlNetModel(ModelMixin, ConfigMixin, PeftAdapterMixin, FromOriginalModelMixin):
_supports_gradient_checkpointing = True

@register_to_config
def __init__(
    self,
    sample_size: int = 128,
    patch_size: int = 2,
    in_channels: int = 16,
    num_layers: int = 18,
    attention_head_dim: int = 64,
    num_attention_heads: int = 18,
    joint_attention_dim: int = 4096,
    caption_projection_dim: int = 1152,
    pooled_projection_dim: int = 2048,
    out_channels: int = 16,
    pos_embed_max_size: int = 96,
    extra_conditioning_channels: int = 0,
    dual_attention_layers=(),
    qk_norm=None,
   
):
    super().__init__()
    default_out_channels = in_channels
    self.out_channels = out_channels if out_channels is not None else default_out_channels
    self.inner_dim = num_attention_heads * attention_head_dim

Logs

Traceback (most recent call last):
  File "/data/user/user/project/controlnet_huggingface/diffusers/examples/controlnet/train_controlnet_sd3.py", line 1412, in <module>
    main(args)
  File "/data/user/user/project/controlnet_huggingface/diffusers/examples/controlnet/train_controlnet_sd3.py", line 989, in main
    controlnet = SD3ControlNetModel.from_transformer(transformer)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/user/data/conda/envs/diffusers/lib/python3.11/site-packages/diffusers/models/controlnets/controlnet_sd3.py", line 251, in from_transformer
    controlnet = cls(**config)
                 ^^^^^^^^^^^^^
  File "/home/user/data/conda/envs/diffusers/lib/python3.11/site-packages/diffusers/configuration_utils.py", line 665, in inner_init
    init(self, *args, **init_kwargs)
TypeError: SD3ControlNetModel.__init__() got an unexpected keyword argument 'dual_attention_layers'

System Info

diffusers: 0.32.0.dev0

Who can help?

@yiyixuxu @sayakpaul @DN6 @asomoza

The text was updated successfully, but these errors were encountered:

sayakpaul · 2024-11-20T13:01:18Z

Hi @bigbraindump thanks for the issue. Would you like to contribute a PR for this? Additionally, do we have an SD3.5 ControlNet checkpoint already?

bigbraindump · 2024-11-20T14:51:48Z

thanks and sure thing. added pr #9977. there only seems to be pr #8566 addressing controlnet SD3 training.

yiyixuxu · 2024-11-20T22:34:13Z

I think it is duplicated with #9758

bigbraindump · 2024-11-21T00:06:53Z

thanks for the updated script x

bigbraindump added the bug Something isn't working label Nov 20, 2024

sayakpaul assigned yiyixuxu and unassigned yiyixuxu Nov 20, 2024

This was referenced Nov 20, 2024

addressing issue #5, SD3 training sign-language-processing/signwriting-illustration#11

Draft

controlnet_sd3 parameters updated #9977

Closed

bigbraindump closed this as completed Nov 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add expected parameters to controlnet_sd3 #9974

add expected parameters to controlnet_sd3 #9974

bigbraindump commented Nov 20, 2024

sayakpaul commented Nov 20, 2024

bigbraindump commented Nov 20, 2024

yiyixuxu commented Nov 20, 2024

bigbraindump commented Nov 21, 2024

add expected parameters to controlnet_sd3 #9974

add expected parameters to controlnet_sd3 #9974

Comments

bigbraindump commented Nov 20, 2024

Describe the bug

Reproduction

Logs

System Info

Who can help?

sayakpaul commented Nov 20, 2024

bigbraindump commented Nov 20, 2024

yiyixuxu commented Nov 20, 2024

bigbraindump commented Nov 21, 2024