[Training] make checkpointing compatible when using `torch.compile` (part II) #6511

sayakpaul · 2024-01-10T03:33:53Z

What does this PR do?

Follow-up of: #6483

sayakpaul · 2024-01-10T08:15:42Z

examples/dreambooth/train_dreambooth_lora_sdxl.py

    accelerator.wait_for_everyone()
    if accelerator.is_main_process:
-        unet = accelerator.unwrap_model(unet)
+        unet = unwrap_model(unet)


For intermediate places (such as performing validation inference) where we do use accelerator.unwrap_model() -- it's not an issue as the models are directly used. But here, since we're obtaining the state dicts, we need to get the _orig_mod out in case torch.compile() was called. LMK if it's not clear.

HuggingFaceDocBuilderDev · 2024-01-10T08:21:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

patrickvonplaten

Nice!

…part II) (huggingface#6511) make checkpointing compatible when using torch.compile.

make checkpointing compatible when using torch.compile.

79e1489

sayakpaul requested a review from patrickvonplaten January 10, 2024 03:33

sayakpaul mentioned this pull request Jan 10, 2024

[Training] Make officially maintained training scripts compatible with torch.compile() #6503

Closed

11 tasks

sayakpaul changed the title ~~[Training] make checkpointing compatible when using torch.compile.~~ [Training] make checkpointing compatible when using torch.compile (part II) Jan 10, 2024

Merge branch 'main' into torch-compile-compatible-training-ii

e6ea5e1

sayakpaul commented Jan 10, 2024

View reviewed changes

sayakpaul mentioned this pull request Jan 11, 2024

SD text-to-image torch compile compatible #6519

Merged

6 tasks

suvadityamuk mentioned this pull request Jan 11, 2024

Make ControlNet SD Training Script torch.compile compatible #6525

Merged

6 tasks

charchit7 mentioned this pull request Jan 11, 2024

Make ControlNet SDXL Training Script torch.compile compatible #6526

Merged

6 tasks

patrickvonplaten approved these changes Jan 11, 2024

View reviewed changes

sayakpaul merged commit be0b425 into main Jan 11, 2024

sayakpaul deleted the torch-compile-compatible-training-ii branch January 11, 2024 13:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Training] make checkpointing compatible when using `torch.compile` (part II) #6511

[Training] make checkpointing compatible when using `torch.compile` (part II) #6511

Uh oh!

sayakpaul commented Jan 10, 2024

Uh oh!

sayakpaul Jan 10, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jan 10, 2024

Uh oh!

patrickvonplaten left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Training] make checkpointing compatible when using torch.compile (part II) #6511

[Training] make checkpointing compatible when using torch.compile (part II) #6511

Uh oh!

Conversation

sayakpaul commented Jan 10, 2024

What does this PR do?

Uh oh!

sayakpaul Jan 10, 2024

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 10, 2024

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Training] make checkpointing compatible when using `torch.compile` (part II) #6511

[Training] make checkpointing compatible when using `torch.compile` (part II) #6511