Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why the generated voice sounds so unreal? #564

Closed
chris795 opened this issue Oct 16, 2020 · 1 comment
Closed

Why the generated voice sounds so unreal? #564

chris795 opened this issue Oct 16, 2020 · 1 comment

Comments

@chris795
Copy link

chris795 commented Oct 16, 2020

I try this on my GPU clound and use my own voice (20 minutes length) as a sample, but the generated voice sound too far from my real voice.

@ghost
Copy link

ghost commented Oct 16, 2020

Please read through #411 for some clues to why this is the case. Can also read #41 and #364. Some voices will clone better than others due to their representation by the speaker encoder and the training data used.

If you want better similarity with your own voice, you can generate your own training data and finetune a single-speaker model. See #437 for information on how to do it (no support available).

@ghost ghost closed this as completed Oct 16, 2020
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant