Enable inference mode for testing and predicting #8813

tangbinh · 2021-08-09T16:16:06Z

What does this PR do?

As proposed in Support for PyTorch's optimize_for_inference mode #8499, we enable inference mode for evaluation and prediction. The feature was included recently in PyTorch 1.9 and comes with computational benefits.
We also correct a link to the previous pull request Move logger and profiler finalization to trainer's teardown #8685 in the change log.

Does your PR introduce any breaking changes? If yes, please list them.

None

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2021-08-09T16:17:34Z

Codecov Report

Merging #8813 (fe99e5f) into master (ca679cd) will decrease coverage by 4%.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master   #8813    +/-   ##
=======================================
- Coverage      92%     88%    -4%     
=======================================
  Files         179     179            
  Lines       14910   14908     -2     
=======================================
- Hits        13760   13145   -615     
- Misses       1150    1763   +613

pytorch_lightning/trainer/trainer.py

ananthsub · 2021-08-09T17:42:32Z

@tangbinh thanks for working on this! It'd be awesome to include information on performance speedups for some candidate models as well

tangbinh · 2021-08-10T05:24:52Z

Should we allow the users to opt out of the inference mode if they want to? I imagine some cases where the gradients are needed at test time (e.g. higher-order derivatives), and being able to turn it off by a flag might be important. I'm think of having _TORCH_GREATER_EQUAL_1_9 and os.getenv("NO_INFERENCE_MODE", "False").lower() == "false".

tangbinh · 2021-08-10T05:47:20Z

@tangbinh ... It'd be awesome to include information on performance speedups for some candidate models as well

Do you have suggestions for the candidate models and some datasets to test them on? I was running some simple tasks and didn't notice much of a difference.

ananthsub · 2021-08-10T05:56:09Z

Do you have suggestions for the candidate models and some datasets to test them on? I was running some simple tasks and didn't notice much of a difference.

@Borda do you recommend the models used for the parity benchmark?

pytorch_lightning/trainer/trainer.py

carmocca

For the 2 cases where no_grad has been replaced already, we don't need to check the TrainerFn because by design, they will only be called during trainer.validate(), trainer.test(), or trainer.predict(). On these, no grad operations should take place so we are safe.

This is different from the following, which is the validation part of trainer.fit. This is where I'm not sure if no_grad should be kept:

https://github.com/PyTorchLightning/pytorch-lightning/blob/e0605472306d6b95bf2616ab88f8c29f4498402e/pytorch_lightning/loops/epoch/training_epoch_loop.py#L247

pytorch_lightning/trainer/trainer.py

tangbinh · 2021-08-19T21:40:07Z

@ananthsub @tchaton @awaelchli @carmocca Please see this notebook for some performance tests. It looks like for some common models and datasets, enabling inference mode doesn't make much of a difference in terms of inference time. The observation is consistent on both GPU and CPU. That said, I think it doesn't hurt if we decide to move forward with inference mode.

ananthsub

thanks for working on this @tangbinh !

pytorch_lightning/trainer/trainer.py

tchaton · 2021-08-20T16:00:35Z

Dear @tangbinh @ananthsub,

Could we get the benchmark for some traditional models to validate this doesn't bring a regression for Lightning performances + updated docs.

Best,
T.C

pytorch_lightning/trainer/trainer.py

tangbinh · 2021-08-20T16:35:07Z

Could we get the benchmark for some traditional models to validate this doesn't bring a regression for Lightning performances + updated docs.

Would you mind proposing some traditional models that you have in mind? I've done some benchmark tests, although they don't follow previous examples as I couldn't find any good one.

Fixes Lightning-AI#9431

Fixes #9431

ananthsub reviewed Aug 9, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

ananthsub linked an issue Aug 9, 2021 that may be closed by this pull request

Support for PyTorch's optimize_for_inference mode #8499

Closed

tchaton reviewed Aug 11, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

tangbinh changed the title ~~Enable inference mode for evaluation and prediction~~ Enable inference mode for testing and predicting Aug 11, 2021

tangbinh marked this pull request as ready for review August 11, 2021 22:59

tangbinh requested review from awaelchli, Borda, carmocca, justusschock, kaushikb11, SeanNaren and williamFalcon as code owners August 11, 2021 22:59

awaelchli added bug Something isn't working feature Is an improvement or enhancement and removed bug Something isn't working labels Aug 11, 2021

awaelchli reviewed Aug 11, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

mergify bot added has conflicts and removed has conflicts labels Aug 13, 2021

carmocca reviewed Aug 14, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

mergify bot added the has conflicts label Aug 17, 2021

Enable inference mode for testing and predicting

81c034d

mergify bot removed the has conflicts label Aug 19, 2021

ananthsub approved these changes Aug 19, 2021

View reviewed changes

awaelchli reviewed Aug 20, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Outdated Show resolved Hide resolved

tchaton reviewed Aug 20, 2021

View reviewed changes

pytorch_lightning/trainer/trainer.py Show resolved Hide resolved

mergify bot added the has conflicts label Aug 23, 2021

Update pytorch_lightning/trainer/trainer.py

b560eca

mergify bot removed the has conflicts label Sep 3, 2021

carmocca approved these changes Sep 4, 2021

View reviewed changes

mergify bot added the ready PRs ready to be merged label Sep 4, 2021

Borda requested review from awaelchli and tchaton September 6, 2021 07:44

awaelchli added this to the v1.5 milestone Sep 6, 2021

awaelchli approved these changes Sep 6, 2021

View reviewed changes

Merge branch 'master' into enhancement/inference-mode

fe99e5f

awaelchli enabled auto-merge (squash) September 8, 2021 21:11

awaelchli merged commit a079d7f into Lightning-AI:master Sep 8, 2021

WeichenXu123 mentioned this pull request Sep 10, 2021

torchmetrics.Accuracy doesn't support inference mode with distributed backend. #9431

Closed

ananthsub added a commit to ananthsub/pytorch-lightning that referenced this pull request Sep 10, 2021

Revert Lightning-AI#8813

e5186b5

Fixes Lightning-AI#9431

ananthsub mentioned this pull request Sep 10, 2021

[bugfix] Revert inference mode support #9443

Merged

12 tasks

ananthsub added a commit that referenced this pull request Sep 10, 2021

[bugfix] Revert inference mode support from #8813 (#9443)

d2def36

Fixes #9431

ananthsub mentioned this pull request Dec 9, 2021

use torch.inference_mode() in Trainer.predict #11018

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable inference mode for testing and predicting #8813

Enable inference mode for testing and predicting #8813

tangbinh commented Aug 9, 2021 •

edited

Loading

codecov bot commented Aug 9, 2021 •

edited

Loading

ananthsub commented Aug 9, 2021

tangbinh commented Aug 10, 2021

tangbinh commented Aug 10, 2021

ananthsub commented Aug 10, 2021

carmocca left a comment •

edited

Loading

tangbinh commented Aug 19, 2021

ananthsub left a comment

tchaton commented Aug 20, 2021 •

edited

Loading

tangbinh commented Aug 20, 2021

Enable inference mode for testing and predicting #8813

Enable inference mode for testing and predicting #8813

Conversation

tangbinh commented Aug 9, 2021 • edited Loading

What does this PR do?

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

codecov bot commented Aug 9, 2021 • edited Loading

Codecov Report

ananthsub commented Aug 9, 2021

tangbinh commented Aug 10, 2021

tangbinh commented Aug 10, 2021

ananthsub commented Aug 10, 2021

carmocca left a comment • edited Loading

Choose a reason for hiding this comment

tangbinh commented Aug 19, 2021

ananthsub left a comment

Choose a reason for hiding this comment

tchaton commented Aug 20, 2021 • edited Loading

tangbinh commented Aug 20, 2021

tangbinh commented Aug 9, 2021 •

edited

Loading

codecov bot commented Aug 9, 2021 •

edited

Loading

carmocca left a comment •

edited

Loading

tchaton commented Aug 20, 2021 •

edited

Loading