-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[1/4] Add get_device_stats to accelerator interface #9586
[1/4] Add get_device_stats to accelerator interface #9586
Conversation
Codecov Report
@@ Coverage Diff @@
## master #9586 +/- ##
=======================================
- Coverage 93% 89% -4%
=======================================
Files 181 179 -2
Lines 15349 15350 +1
=======================================
- Hits 14246 13591 -655
- Misses 1103 1759 +656 |
for more information, see https://pre-commit.ci
…ightning into get_device_stats
for more information, see https://pre-commit.ci
…ightning into get_device_stats
for more information, see https://pre-commit.ci
…ightning into get_device_stats
for more information, see https://pre-commit.ci
Hey @daniellepintz, thanks for linking all the following PRs :) noob question for PR 2/4. Should we differentiate Best, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@daniellepintz it'd be good to also add tests for the GPU and TPU accelerator to confirm that get_device_stats returns something
@tchaton this was modeled after GPUStatsMonitor where we log in both places. I myself am not sure what is best so please lmk if you think I should change this |
@ananthsub I believe any tests I would write would be similar to the ones in https://github.com/daniellepintz/pytorch-lightning/pull/2/files, or did you have something else in mind? |
…ightning into get_device_stats
the tests there are end to end tests relying on multiple components: the accelerator, callback, and trainer. the accelerator tests would test similar functionality but should be more self-contained as a unit test specifically for having the tests here & in the callback will be helpful to detect if something breaks in one place vs another |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
…ightning into get_device_stats
Looking at the output of the CI TPU tests I see two odd things:
Does anyone have any insight into these issues? |
Hey @kaushikb11 Mind looking into @daniellepintz TPU testing ? |
What does this PR do?
Adds
get_device_stats
function to accelerator class, and implements it for GPU and TPU. To be called in later PRsTested with daniellepintz#2
Part of #9032
Please also see subsequent PRs:
daniellepintz#2
daniellepintz#3
daniellepintz#4
Does your PR introduce any breaking changes? If yes, please list them.
Before submitting
PR review
Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:
Did you have fun?
Make sure you had fun coding 🙃