[Sd3 Dreambooth LoRA] Add text encoder training for the clip encoders #8630

linoytsaban · 2024-06-19T00:10:05Z

add text encoder training support for the CLIP encoders to the dreambooth lora training script for SD3

…eambooth-lora

HuggingFaceDocBuilderDev · 2024-06-19T00:23:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

r-aristov · 2024-06-19T08:54:29Z

When no individual prompts provided, only instance prompts, got this:

Traceback (most recent call last):
  File "/root/train_dreambooth_lora_sd3.py", line 1836, in <module>
    main(args)
  File "/root/train_dreambooth_lora_sd3.py", line 1631, in main
    encoder_hidden_states=prompt_embeds,
UnboundLocalError: local variable 'prompt_embeds' referenced before assignment

linoytsaban · 2024-06-19T16:05:22Z

thanks @r-aristov! I think it should be working now

sayakpaul · 2024-06-20T08:48:40Z

src/diffusers/loaders/lora.py

+        if text_encoder_lora_layers:
+            state_dict.update(pack_weights(text_encoder_lora_layers, "text_encoder"))
+
+        if text_encoder_2_lora_layers:
+            state_dict.update(pack_weights(text_encoder_2_lora_layers, "text_encoder_2"))


Can we confirm this via experiments that text encoder 3 training doesn't matter too much? Can be done separately and won't block this PR.

Actually we skipped not because we think it doesn't matter but because we thought dealing with training the T5 would be a different animal than the already well known CLIP text encoder training (also on VRAM consumption side). So indeed we left it to a future PR to investigate the T5 training!

Slightly worried about the dynamics about this so, let’s make sure we run ample experiments to see if training two text encoders while keeping the other one fixed works as expected.

examples/dreambooth/train_dreambooth_lora_sd3.py

sayakpaul

Thanks. I have left some comments. My main question is how much does training the text encoder matter here given we use three in SD3? Could we see some concrete comparative examples?

Additionally, we need to add tests to https://github.com/huggingface/diffusers/blob/main/tests/lora/test_lora_layers_sd3.py and add a note about --train_text_encoder in the REAMDE.

linoytsaban · 2024-06-24T12:52:17Z

some results for comparison

training config:

!accelerate launch train_dreambooth_lora_sd3.py \
  --pretrained_model_name_or_path="stabilityai/stable-diffusion-3-medium-diffusers"  \
  --dataset_name="Norod78/Yarn-art-style"\
  --output_dir="dreambooth-sd3-lora"\
  --mixed_precision="fp16" \
  --instance_prompt="a photo of TOK yarn art dog" \
  --resolution=1024 \
  --train_batch_size=1 \
  **--train_text_encoder\**
  --gradient_accumulation_steps=1 \
  --optimizer="prodigy"\
  --learning_rate=1.0 \
  **--text_encoder_lr=1.0\**
  --report_to="wandb" \
  --lr_scheduler="constant" \
  --lr_warmup_steps=0 \
  --max_train_steps=1500 \
  --repeats=1\
  --rank=32\
  --weighting_scheme="logit_normal" \
  --validation_epochs=100 \
  --seed="0" \
  --push_to_hub

sayakpaul · 2024-06-24T13:07:53Z

Cool, the results are stunning. So, the TODOs are:

Add relevant tests to https://github.com/huggingface/diffusers/blob/main/tests/lora/test_lora_layers_sd3.py
Add documentation to the SD3 DreamBooth README
Once [Tests] add test suite for SD3 DreamBooth #8650 is merged, I will take care of adding the text encoder training-related tests.

…eambooth-lora

tests/lora/test_lora_layers_sd3.py

examples/dreambooth/README_sd3.md

sayakpaul

Excellent work!

Co-authored-by: Sayak Paul <[email protected]>

…#8630) * add clip text-encoder training * no dora * text encoder traing fixes * text encoder traing fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * add text_encoder layers to save_lora * style * fix imports * style * fix text encoder * review changes * review changes * review changes * minor change * add lora tag * style * add readme notes * add tests for clip encoders * style * typo * fixes * style * Update tests/lora/test_lora_layers_sd3.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/README_sd3.md Co-authored-by: Sayak Paul <[email protected]> * minor readme change --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

linoytsaban and others added 11 commits June 18, 2024 15:01

add clip text-encoder training

440c894

Merge branch 'huggingface:main' into sd3-dreambooth-lora

69bfa3f

no dora

9a55782

text encoder traing fixes

2bc7342

text encoder traing fixes

4640258

text encoder training fixes

1e52389

text encoder training fixes

1e3f00f

text encoder training fixes

2bc1fb6

Merge branch 'huggingface:main' into sd3-dreambooth-lora

ba10364

text encoder training fixes

3ffb522

Merge remote-tracking branch 'origin/sd3-dreambooth-lora' into sd3-dr…

b814eef

…eambooth-lora

linoytsaban added 4 commits June 18, 2024 17:31

add text_encoder layers to save_lora

5e14725

style

4c1dc4b

fix imports

e3ef7f8

style

8d9e008

linoytsaban marked this pull request as ready for review June 19, 2024 00:47

linoytsaban and others added 2 commits June 19, 2024 09:01

fix text encoder

9b2968a

Merge branch 'main' into sd3-dreambooth-lora

d4ac28f

Merge branch 'main' into sd3-dreambooth-lora

983a03e

yiyixuxu requested a review from sayakpaul June 19, 2024 22:38