-
Notifications
You must be signed in to change notification settings - Fork 707
[Scheduler] Add support for cosine and wsd scheduler #938
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
30 commits
Select commit
Hold shift + click to select a range
629bd51
[Scheduler] Add support for cosine and wsd scheduler
yzhangcs 3c120e3
[Misc.] Log learning rate
yzhangcs 2fc78e2
Unify the three decay lambda fns
yzhangcs fce4a14
Remove the default value in function signature
yzhangcs 29281a6
Update toml configs
yzhangcs ed6e1e1
Configurable `lr_decay_ratio`
yzhangcs 12e83be
[Scheduler] Rename `lr_decay_fn` to `linear_warmup_stable_decay`
yzhangcs d9b91a5
Delete `lr_decay_type` check in `build_lr_schedulers`
yzhangcs bbc82b2
Revert changes on train.py
yzhangcs 2230d3a
[Config] Move scheduler-related params to [scheduler] section
yzhangcs 01b4b62
Update train.py
yzhangcs e246428
Update train.py
yzhangcs 3a14cf5
Add all scheduler configs in debug config
yzhangcs 69b05df
Add warnings if warmup_stable_steps < warmup_steps
yzhangcs 827395c
Revert changes on train.py
yzhangcs f3293ab
Obey the code format
yzhangcs 72a0286
int type warmup_stable_steps
yzhangcs 2e2b6b4
Rename `training.warmup_steps` to `scheduler.warmup_steps`
yzhangcs 698d63c
Rename `scheduler` to `lr_scheduler`
yzhangcs 5f742f5
[Legal] Modifications requested by legal for adding additional datase…
lessw2020 e9fe2e5
[FSDP2][doc] highlight set_requires_gradient_sync and ignored_params …
weifengpy f5a9abe
[PP] Ensure loss is visible on console for users (#946)
lessw2020 6d8da38
Make MetricsLogger as a component (#945)
fegin f395ed2
[Misc.] Log learning rate
yzhangcs af00afb
Update train.py
yzhangcs 65f5f66
Add warnings if warmup_stable_steps < warmup_steps
yzhangcs 1e61236
Revert changes on train.py
yzhangcs 6328fc7
Rename `training.warmup_steps` to `scheduler.warmup_steps`
yzhangcs f378b2f
Merge branch 'main' into main
yzhangcs 1eb7c71
Fix code formats
yzhangcs File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.