generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.5k
small KTO fixes #1734
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
small KTO fixes #1734
Changes from all commits
Commits
Show all changes
50 commits
Select commit
Hold shift + click to select a range
6ee3be4
add warning for imbalanced data
kawine 22dd810
update documentation
kawine 8d14930
update script commands to be same as in dpo
kawine 8a490af
use batch_size KL examples and batch_size target examples to calculat…
kawine f826600
fix deepspeed issue
kawine 688ed6c
speed up forward with no_grad for KL
kawine 587517b
Merge branch 'huggingface:main' into main
kawine e128f09
add some removed metrics
kawine 2d860b8
Update trl/trainer/kto_trainer.py
kashif 48d25ff
Update trl/trainer/kto_trainer.py
kashif 392bcc0
Update trl/trainer/kto_trainer.py
kawine a42049f
Update trl/trainer/kto_trainer.py
kawine 5696814
Update trl/trainer/kto_trainer.py
kawine 000d5d8
Update trl/trainer/kto_trainer.py
kawine 2738d1f
Update trl/trainer/kto_trainer.py
kawine d7f63c5
Update trl/trainer/kto_trainer.py
kawine 824da55
Update trl/trainer/kto_trainer.py
kawine 4399af4
Update trl/trainer/kto_trainer.py
kawine 69094be
Update trl/trainer/kto_trainer.py
kawine 73f7ed7
Update trl/trainer/kto_trainer.py
kawine 5b95aca
Update trl/trainer/kto_trainer.py
kawine 3102901
Update trl/trainer/kto_trainer.py
kawine ca68f24
add more detailed comments
kawine 94fb375
convert assert to ValueError
kawine 8f7e788
Update kto_trainer.py
kawine ed19ed5
precommit formatting
kawine 310bd97
Merge branch 'main' of https://github.com/kawine/trl into main
kawine 639f4de
Merge branch 'huggingface:main' into main
kawine ee7d6a4
remove nans in metrics by gathering across machines
kawine 7ae95c2
fix formatting
kawine 1b96b2d
fix choice of mismatched examples for KL term
kawine 81b60da
describe weights
kawine 1f145b9
fix hanging issue in distributed training
kawine 83ed882
linting
kawine 9c5480d
Merge branch 'main' of https://github.com/kawine/trl into main
kawine 15251ff
move metrics to cpu
kawine 8f9fdfe
Update trl/trainer/kto_trainer.py
kawine 600aad8
Update trl/trainer/kto_trainer.py
kashif 8b5367e
Update trl/trainer/kto_trainer.py
kashif 5cc6fed
Merge branch 'huggingface:main' into main
kawine 03dfe90
Merge branch 'huggingface:main' into main
kawine cf6217e
Merge branch 'main' of https://github.com/kawine/trl into main
xwinxu e3a3691
Merge branch 'huggingface:main' into main
xwinxu e88b5ed
Merge branch 'huggingface:main' into main
kawine 12f99b9
remove kto_pair
kawine 2424ef0
resolve conflicts
kawine af7c424
speed up data processing
kawine 95f361b
move bco code inside
kawine 4c234a3
raise error for kto_pair argument
kawine e88de4c
fix formatting
kawine File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.