Skip to content

Fix squared optimization steps bug in distillation trainer#284

Merged
tomaarsen merged 1 commit intohuggingface:mainfrom
tomaarsen:squared_steps_distillation
Jan 23, 2023
Merged

Fix squared optimization steps bug in distillation trainer#284
tomaarsen merged 1 commit intohuggingface:mainfrom
tomaarsen:squared_steps_distillation

Conversation

@tomaarsen
Copy link
Member

Hello!

Pull Request overview

Details

The train method for the distillation trainer is very similar to the train method on the regular trainer. As a result, it also contained the bug from #280 and the corresponding fix could be copied from there without any issues. For additional details, see that PR or #268 (comment) which originally showcases the bug.

  • Tom Aarsen

@tomaarsen tomaarsen added the bug Something isn't working label Jan 23, 2023
@tomaarsen tomaarsen merged commit 0cb8ffd into huggingface:main Jan 23, 2023
@tomaarsen tomaarsen deleted the squared_steps_distillation branch January 23, 2023 12:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant