-
Notifications
You must be signed in to change notification settings - Fork 31.9k
Fix whisper kwargs and generation config #30018
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
gante
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thank you for fixing 💛
(I am surprised that no failures were raised in the previous PR's CI 👀 )
|
likely test fetcher didn't find it ... |
|
well, i see , whisper is not in IMPORTANT_MODELS = [ should we put it ... 😄 @sanchit-gandhi @ArthurZucker ? |
sanchit-gandhi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome - thanks for the fix @zucchini-nlp! Requesting final review from @ArthurZucker
|
I would be in favour of adding Whisper and possible Wav2Vec2 as well to this list as the two most used audio models @ydshieh |
|
@sanchit-gandhi |
ArthurZucker
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you resolve merge conflicts since #29225 was merged!
|
Done! |
ArthurZucker
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks
|
@zucchini-nlp Do you have the merge permission (i.e. click the |
|
yep, merged! |
What does this PR do?
Whisper code was changed recently to use
prepare_generation_configfrom the GenerationMixin here, which started causing some bugs. See failing test in this PR.Before, Whisper generation config did not do any
generation_config.updateand manually updated kwargs along the way. After the above change, kwargs cannot be used anymore because it contains unused kwargs from generation config.This PR replaces everywhere we had
kwargs.pop()withgeneraion_config. All tests are passing, except for some slow tests. But the slow tests are not passing inmainalso, so should be unrelated.