Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make preemptions load last found state #1025

Merged
merged 4 commits into from
Feb 24, 2025
Merged

Conversation

rayg1234
Copy link
Contributor

@rayg1234 rayg1234 commented Feb 22, 2025

For preemption checkpointing using submitit, it only calls the checkpoint callback on the main rank, which is not sufficient for DCP checkpointing. So here instead we'll just try to find the last seen checkpoint

Copy link

codecov bot commented Feb 22, 2025

Codecov Report

Attention: Patch coverage is 25.00000% with 6 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
src/fairchem/core/_cli_hydra.py 0.00% 4 Missing ⚠️
src/fairchem/core/common/utils.py 50.00% 2 Missing ⚠️
Files with missing lines Coverage Δ
src/fairchem/core/common/utils.py 67.51% <50.00%> (-0.10%) ⬇️
src/fairchem/core/_cli_hydra.py 83.23% <0.00%> (-0.49%) ⬇️

@rayg1234 rayg1234 added the enhancement New feature or request label Feb 22, 2025
@rayg1234 rayg1234 added the minor Minor version release label Feb 23, 2025
@rayg1234 rayg1234 requested a review from misko February 24, 2025 04:54
@rayg1234 rayg1234 added this pull request to the merge queue Feb 24, 2025
Merged via the queue into main with commit d07fd0e Feb 24, 2025
8 of 9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request minor Minor version release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants