-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
There is no speaker id in my dataset, what should I do? #861
Comments
Familiarize yourself with the LibriSpeech dataset format. Then reformat your dataset to resemble LibriSpeech. This requires you to separate your audio files by speaker. Hopefully you are provided enough information to do this. If not, you will need a different dataset. |
Unfortunately, no... I will try to teach the encoder to LibriSpeech, LibriTTS, voxforge(1,2) and mozilla common voice(Russian, Belarusian, English) |
Should I teach the model in more than two languages? Will it affect the quality? |
If the symbols in your languages are mutually exclusive, it should be possible. However, I wouldn't recommend it until you have a lot of experience with training models. |
By model I mean encoder |
It was attempted in #126. Although the encoder model performed very well for speaker verification, the results for voice cloning were not good in that one instance. |
No description provided.
The text was updated successfully, but these errors were encountered: