Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

There is no speaker id in my dataset, what should I do? #861

Closed
fancat-programer opened this issue Oct 5, 2021 · 6 comments
Closed

There is no speaker id in my dataset, what should I do? #861

fancat-programer opened this issue Oct 5, 2021 · 6 comments

Comments

@fancat-programer
Copy link

No description provided.

@ghost
Copy link

ghost commented Oct 5, 2021

Familiarize yourself with the LibriSpeech dataset format. Then reformat your dataset to resemble LibriSpeech. This requires you to separate your audio files by speaker.

Hopefully you are provided enough information to do this. If not, you will need a different dataset.

@fancat-programer
Copy link
Author

Unfortunately, no... I will try to teach the encoder to LibriSpeech, LibriTTS, voxforge(1,2) and mozilla common voice(Russian, Belarusian, English)

@fancat-programer
Copy link
Author

Should I teach the model in more than two languages? Will it affect the quality?

@ghost
Copy link

ghost commented Oct 6, 2021

If the symbols in your languages are mutually exclusive, it should be possible. However, I wouldn't recommend it until you have a lot of experience with training models.

@ghost ghost closed this as completed Oct 6, 2021
@fancat-programer
Copy link
Author

If the symbols in your languages are mutually exclusive, it should be possible. However, I wouldn't recommend it until you have a lot of experience with training models.

By model I mean encoder

@ghost
Copy link

ghost commented Oct 6, 2021

It was attempted in #126. Although the encoder model performed very well for speaker verification, the results for voice cloning were not good in that one instance.

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant