We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello. I want to train this model on a new language. I want to know what structure the dataset should have for this model.
The text was updated successfully, but these errors were encountered:
I‘m also questioning... maybe just .wav file is ok, but haven't confirmed up to now
Sorry, something went wrong.
@RafaelJCruz whisper (esp. the default medium) is not that perfect, for transcription.
medium
@snmahsa the dataset standards like LJSpeech might be sufficient.
In case you like me were searching for proper dataset size they used for their non-english languages (e.g. Japanese)? #96 (comment)
No branches or pull requests
Hello. I want to train this model on a new language. I want to know what structure the dataset should have for this model.
The text was updated successfully, but these errors were encountered: