Make manual optimization mandatory for multiple optimizers #16539

awaelchli · 2023-01-28T02:16:55Z

What does this PR do?

WARNING Estimated time to review: 2 hours + 🤣.

This PR reworks how optimization with multiple optimizers works.

Before

User returns multiple optimizers from the configure_optimizers hook
The training step gets a optimizer_idx argument and gets called as many times with the same batch as there are optimizers.

Pro:

User doesn't have to write the zero_grad and step calls
Toggling of optimizers handled automatically

Cons:

Users struggle to understand how it works under the hood
Stepping optimizers at different intervals is possible, but cumbersome to configure
Loop implementation is complex, hard to follow and debug for contributors
Non-standard optimization requires the user to override several hooks (backward, optimizer_step) etc. which increases the surface for bugs (simplest example is GAN).
Complex output post-processing for the batch-end and epoch-end hooks required (lists of lists of lists)

Now

Returning multiple optimizers from configure_optimizers hook will raise an error. You need to set self.automatic_optimization=False and implement manual optimization in your training step.

Pro:

Non-standard optimization is more intuitive to implement (simplest example is GAN)
Stepping optimizers at custom intervals is easy to understand and debug because logic is inlined in training step by user
Loops become simpler
No special outputs processing needed for batch-end and epoch-end hooks

Cons:

User has to call the zero_grad and step and manual_backward manually.
Gradient accumulation is not supported out of the box for manual optimization. You have to accumulate yourself. For the highest flexibility, Fabric should be considered.

Does your PR introduce any breaking changes? If yes, please list them.

Removed opt_idx argument from BaseFinetuning.finetune_function callback method
Removed opt_idx argument from Callback.on_before_optimizer_step callback method
Removed optimizer_idx as an optional argument in LightningModule.training_step
Removed optimizer_idx argument from LightningModule.on_before_optimizer_step
Removed optimizer_idx argument from LightningModule.configure_gradient_clipping
Removed optimizer_idx argument from LightningModule.optimizer_step
Removed optimizer_idx argument from LightningModule.optimizer_zero_grad
Removed optimizer_idx argument from LightningModule.lr_scheduler_step
Removed support for declaring optimizer frequencies in the dictionary returned from LightningModule.configure_optimizers
Removed arguments optimizer and optimizer_idx from LightningModule.backward
Removed optimizer_idx argument from PrecisionPlugin.optimizer_step and all of its overrides in subclasses
Removed optimizer_idx argument from PrecisionPlugin.{optimizer_step,backward} and all of its overrides in subclasses
Removed optimizer_idx argument from Strategy.{optimizer_step,backward} and all of its overrides in subclasses
Removed Trainer.optimizer_frequencies attribute

Follow ups to this PR:

Documentation updates
Demote optimizer loop class from being a loop

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

I made sure I had fun coding 🙃

cc @justusschock @awaelchli @Borda @carmocca

for more information, see https://pre-commit.ci

src/pytorch_lightning/loops/progress.py

tests/tests_pytorch/utilities/migration/test_migration.py

src/pytorch_lightning/core/module.py

src/pytorch_lightning/core/hooks.py

for more information, see https://pre-commit.ci

tests/tests_pytorch/trainer/optimization/test_optimizers.py

tests/tests_pytorch/trainer/optimization/test_multiple_optimizers.py

tests/tests_pytorch/loops/test_training_loop_flow_scalar.py

tests/tests_pytorch/trainer/dynamic_args/test_multiple_eval_dataloaders.py

tests/tests_pytorch/trainer/optimization/test_optimizers.py

carmocca

LGTM. Huge effort!

src/pytorch_lightning/callbacks/finetuning.py

src/pytorch_lightning/core/module.py

src/pytorch_lightning/plugins/precision/precision_plugin.py

tests/tests_pytorch/trainer/dynamic_args/test_multiple_eval_dataloaders.py

src/pytorch_lightning/core/optimizer.py

src/pytorch_lightning/loops/optimization/optimizer_loop.py

src/pytorch_lightning/loops/progress.py

Co-authored-by: Carlos Mocholí <[email protected]>

for more information, see https://pre-commit.ci

Borda

quite long but over all looks good 👍

for more information, see https://pre-commit.ci

remove optimizer_idx from code base

f3da9c1

github-actions bot added the pl Generic label for PyTorch Lightning package label Jan 28, 2023

pre-commit-ci bot and others added 8 commits January 28, 2023 02:17

[pre-commit.ci] auto fixes from pre-commit.com hooks

89ba1f5

for more information, see https://pre-commit.ci

frequency

1d30f6c

remove test

e08f559

fixes

bb5ce22

more tests

454f273

[pre-commit.ci] auto fixes from pre-commit.com hooks

041bdb1

for more information, see https://pre-commit.ci

fix

5f4f161

fixes

973c197

awaelchli force-pushed the removal/optimizer-loop branch from 4420593 to 973c197 Compare January 29, 2023 03:31

awaelchli added 2 commits January 29, 2023 04:37

wip

c691330

test loops

b29bef7

awaelchli commented Jan 29, 2023

View reviewed changes

src/pytorch_lightning/loops/progress.py Show resolved Hide resolved

awaelchli commented Jan 29, 2023

View reviewed changes

tests/tests_pytorch/utilities/migration/test_migration.py Outdated Show resolved Hide resolved

awaelchli added 6 commits January 29, 2023 17:29

fixes

19e6463

fix none loss return

0b51930

gan

2ab83c6

fixes

0b6b725

convert test

23a3af2

convert tests

7daede6

awaelchli commented Jan 29, 2023

View reviewed changes

src/pytorch_lightning/core/module.py Outdated Show resolved Hide resolved

awaelchli added 2 commits January 29, 2023 20:00

delete code

8ad4895

fixes

8f84246

awaelchli commented Jan 29, 2023

View reviewed changes

src/pytorch_lightning/core/hooks.py Outdated Show resolved Hide resolved

[pre-commit.ci] auto fixes from pre-commit.com hooks

8ed6c4e

for more information, see https://pre-commit.ci

awaelchli commented Jan 29, 2023

View reviewed changes

tests/tests_pytorch/trainer/optimization/test_optimizers.py Outdated Show resolved Hide resolved

awaelchli commented Jan 29, 2023

View reviewed changes

tests/tests_pytorch/trainer/optimization/test_multiple_optimizers.py Outdated Show resolved Hide resolved

awaelchli commented Jan 29, 2023

View reviewed changes

tests/tests_pytorch/loops/test_training_loop_flow_scalar.py Show resolved Hide resolved

undo rename, do it in follow up

af7bc8a

huge changelog

f921a27

awaelchli commented Jan 30, 2023

View reviewed changes

tests/tests_pytorch/trainer/dynamic_args/test_multiple_eval_dataloaders.py Show resolved Hide resolved

accidental change

b41e080

awaelchli commented Jan 30, 2023

View reviewed changes

tests/tests_pytorch/trainer/optimization/test_optimizers.py Show resolved Hide resolved

accidental bad merge

476fe6b

awaelchli self-assigned this Jan 30, 2023

awaelchli changed the title ~~WIP: Make manual optimization mandatory for multiple optimizers~~ Make manual optimization mandatory for multiple optimizers Jan 30, 2023

carmocca approved these changes Jan 31, 2023

View reviewed changes

justusschock approved these changes Jan 31, 2023

View reviewed changes

src/pytorch_lightning/core/optimizer.py Show resolved Hide resolved

src/pytorch_lightning/loops/optimization/optimizer_loop.py Show resolved Hide resolved

src/pytorch_lightning/loops/progress.py Show resolved Hide resolved

mergify bot added the ready PRs ready to be merged label Jan 31, 2023

awaelchli and others added 2 commits January 31, 2023 07:44

Update src/pytorch_lightning/core/module.py

541e1fa

Co-authored-by: Carlos Mocholí <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

271bf50

for more information, see https://pre-commit.ci

mergify bot added has conflicts and removed ready PRs ready to be merged labels Feb 1, 2023

awaelchli mentioned this pull request Feb 1, 2023

Update docs for multiple optimizers in 2.0 #16588

Merged

11 tasks

Borda approved these changes Feb 1, 2023

View reviewed changes

awaelchli mentioned this pull request Feb 1, 2023

opt_idx cleanup after optimizer loop changes #16597

Merged

11 tasks

Merge branch 'master' into removal/optimizer-loop

35a1f7b

mergify bot added ready PRs ready to be merged and removed has conflicts ready PRs ready to be merged labels Feb 1, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

941d135

for more information, see https://pre-commit.ci

awaelchli enabled auto-merge (squash) February 1, 2023 15:57

awaelchli merged commit 6a56586 into master Feb 1, 2023

awaelchli deleted the removal/optimizer-loop branch February 1, 2023 16:21

This was referenced Feb 1, 2023

Rename optimization loops #16598

Merged

Set find_unused_parameters=False as the default #16611

Merged

awaelchli mentioned this pull request Feb 10, 2023

Update examples with multiple optimizers #16710

Merged

11 tasks

DanTremonti mentioned this pull request May 10, 2023

how to properly skip samples that cause inf/nan gradients/loss #4956

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make manual optimization mandatory for multiple optimizers #16539

Make manual optimization mandatory for multiple optimizers #16539

awaelchli commented Jan 28, 2023 •

edited

Loading

carmocca left a comment

Borda left a comment

Make manual optimization mandatory for multiple optimizers #16539

Make manual optimization mandatory for multiple optimizers #16539

Conversation

awaelchli commented Jan 28, 2023 • edited Loading

What does this PR do?

Before

Now

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

carmocca left a comment

Choose a reason for hiding this comment

Borda left a comment

Choose a reason for hiding this comment

awaelchli commented Jan 28, 2023 •

edited

Loading