You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please read through #411 for some clues to why this is the case. Can also read #41 and #364. Some voices will clone better than others due to their representation by the speaker encoder and the training data used.
If you want better similarity with your own voice, you can generate your own training data and finetune a single-speaker model. See #437 for information on how to do it (no support available).
I try this on my GPU clound and use my own voice (20 minutes length) as a sample, but the generated voice sound too far from my real voice.
The text was updated successfully, but these errors were encountered: