Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for large-v2 and large-v3 missing #801

Closed
faspie opened this issue Jun 17, 2024 · 4 comments
Closed

Support for large-v2 and large-v3 missing #801

faspie opened this issue Jun 17, 2024 · 4 comments

Comments

@faspie
Copy link

faspie commented Jun 17, 2024

Pleas add support for the models "large-v2" an "large-v3"

@raivisdejus
Copy link
Collaborator

Faster whisper internally uses large-v2, that is a note.

To your mind is there any reason to keep "large" or does it make sense to switch to "large-v2" and "large-v3"?

In my experience "large-v3" can have more hallucinations than "large-v2".

@faspie
Copy link
Author

faspie commented Jun 18, 2024

I have never used tiny, small or large yet but I am using whisper with a huge server and a Tesla P40. On a standard configuration I would probably use large...

I am using large-v2 and large-v3. With German language large-v3 seems to be more resilient in setting with much ambient noise but has indeed more hallucinations e. g. complex sentences or breaks lead to continous repeat of a sentence. Sometimes there are "free" hallucinations which have absolutely nothing to do with the record.

I would suggest to implement support for large, large-v2 and large-v3

@faspie
Copy link
Author

faspie commented Jun 18, 2024

Perhaps, for unexperienced users, you could mark some settings as "recommended"

@faspie
Copy link
Author

faspie commented Jun 21, 2024

You are great! Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants