-
Notifications
You must be signed in to change notification settings - Fork 31.9k
[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed #10464
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+276
−121
Merged
Changes from all commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
0541df1
pass hf optimizer and scheduler to deepspeed if not specified in ds c…
cli99 30ebb6f
pass hf optimizer and scheduler to deepspeed if not specified in ds c…
cli99 8416a78
Merge branch 'deepspeed' of https://github.com/cli99/transformers int…
cli99 aec38cb
update
cli99 1ed68e1
make init_deepspeed support config dict
stas00 98a1562
fix docstring formatting
stas00 333d8dc
clean up trainer's comments
stas00 9daef95
add new tests
stas00 c0060e9
fix type
stas00 14cdc4b
composit argparse doesn't work
stas00 83e4897
style
stas00 9c73ce3
add a new test, rename others
stas00 1aeb2f2
document new functionality
stas00 4cc0679
Merge remote-tracking branch 'origin/master' into deepspeed
stas00 e78f40e
complete tests, add docs
stas00 605358d
style
stas00 a17c77a
correct level
stas00 c5f06b6
Apply suggestions from code review
stas00 f6d0067
add new methods to the doc
stas00 bb448d6
Merge remote-tracking branch 'origin/master' into deepspeed
stas00 20f395c
must tell DS we are using a non-native optimizer
stas00 8e20811
add protection against cpu_offload + HF optimizer combo
stas00 a2d877d
fix the cli overrides
stas00 e4abec8
sync docs + tests
stas00 dccb770
restore AdamW
stas00 eb4051f
better docs
stas00 3b09360
need new version
stas00 a354f42
no longer needed
stas00 da2fe96
remove outdate information
stas00 dfb0d57
refactor duplicated code
stas00 e758a3e
Merge remote-tracking branch 'origin/master' into deepspeed
stas00 fb84a93
þMerge branch 'master' into deepspeed
cli99 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.