Add MpModelWrapper in TPU Spawn #7045

kaushikb11 · 2021-04-15T18:22:03Z

What does this PR do?

Add MpModelWrapper in TPU Spawn.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

pep8speaks · 2021-04-15T18:22:08Z

Hello @kaushikb11! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-04-20 12:04:48 UTC

pytorch_lightning/plugins/training_type/tpu_spawn.py

kaushikb11 · 2021-04-15T22:26:14Z

Converted to Draft, as tpu tests were failing.

tchaton · 2021-04-19T14:34:30Z

pytorch_lightning/plugins/training_type/tpu_spawn.py

@@ -257,16 +260,16 @@ def start_predicting(self, trainer) -> None:
        xmp.spawn(self.new_process, **self.xmp_spawn_kwargs)

    def training_step(self, *args, **kwargs):
-        return self.lightning_module.training_step(*args, **kwargs)
+        return self.model(*args, **kwargs)


This should be wrapped_model.

Assigning it while moving the model to device. https://github.com/PyTorchLightning/pytorch-lightning/blob/tpu_spawn_added/pytorch_lightning/plugins/training_type/tpu_spawn.py#L172

MpModelWrapper is just using mp lock and returns the model https://github.com/pytorch/xla/blob/master/torch_xla/distributed/xla_multiprocessing.py#L432

tchaton added 2 commits April 15, 2021 23:48

update

77a59e6

update

2bc7ea2

kaushikb11 requested review from awaelchli, justusschock, SeanNaren, tchaton and williamFalcon as code owners April 15, 2021 18:22

kaushikb11 self-assigned this Apr 15, 2021

kaushikb11 added this to the 1.2.x milestone Apr 15, 2021

kaushikb11 added bug Something isn't working feature Is an improvement or enhancement accelerator: tpu Tensor Processing Unit priority: 0 High priority task labels Apr 15, 2021

kaushikb11 added 2 commits April 16, 2021 00:05

fix flake8

c78fdd2

Update changelog

9a6b650

kaushikb11 requested review from Borda and carmocca as code owners April 15, 2021 18:40

tchaton approved these changes Apr 15, 2021

View reviewed changes

tchaton enabled auto-merge (squash) April 15, 2021 20:36

Some non-exhaustive typing fixes

11111e0

carmocca approved these changes Apr 15, 2021

View reviewed changes

pytorch_lightning/plugins/training_type/tpu_spawn.py Outdated Show resolved Hide resolved

carmocca added 2 commits April 15, 2021 23:17

Fix typing kwargs

9ecd785

Typing

b2ff89a

awaelchli approved these changes Apr 15, 2021

View reviewed changes

awaelchli reviewed Apr 15, 2021

View reviewed changes

pytorch_lightning/plugins/training_type/tpu_spawn.py Show resolved Hide resolved

carmocca added 2 commits April 15, 2021 23:45

Fix circular import

58fdba6

Update pytorch_lightning/plugins/training_type/tpu_spawn.py

3bc7083

carmocca disabled auto-merge April 15, 2021 21:47

kaushikb11 marked this pull request as draft April 15, 2021 22:22

Borda modified the milestones: 1.2.x, 1.3 Apr 18, 2021

kaushikb11 and others added 2 commits April 19, 2021 13:38

Merge branch 'master' into tpu_spawn_added

8230691

Update wrapped model logic

a0d1f6b

tchaton reviewed Apr 19, 2021

View reviewed changes

Update weight sharing test

c983709

kaushikb11 marked this pull request as ready for review April 20, 2021 12:23

kaushikb11 enabled auto-merge (squash) April 20, 2021 12:32

kaushikb11 merged commit f168a53 into master Apr 20, 2021

kaushikb11 deleted the tpu_spawn_added branch April 20, 2021 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MpModelWrapper in TPU Spawn #7045

Add MpModelWrapper in TPU Spawn #7045

kaushikb11 commented Apr 15, 2021 •

edited

Loading

pep8speaks commented Apr 15, 2021 •

edited

Loading

kaushikb11 commented Apr 15, 2021

tchaton Apr 19, 2021

kaushikb11 Apr 19, 2021

Add MpModelWrapper in TPU Spawn #7045

Add MpModelWrapper in TPU Spawn #7045

Conversation

kaushikb11 commented Apr 15, 2021 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Apr 15, 2021 • edited Loading

Comment last updated at 2021-04-20 12:04:48 UTC

kaushikb11 commented Apr 15, 2021

tchaton Apr 19, 2021

Choose a reason for hiding this comment

kaushikb11 Apr 19, 2021

Choose a reason for hiding this comment

kaushikb11 commented Apr 15, 2021 •

edited

Loading

pep8speaks commented Apr 15, 2021 •

edited

Loading