-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
poor performance in compare to the main paper? #411
Comments
I would recommend reading #41 and the first few posts of #126 for some context. The main points as I understand them:
|
Don't forget this too:
|
At Resemble.AI we also have better results by using a new vocoder that my colleague @fatchord developed. I believe he's about to publish the paper he wrote about it. |
We can reduce artifacts in the vocoder with additional training (#126 (comment)) . However, it does not make a perceptible difference in the cloned voice. This result also suggests that to the extent the vocoder has an impact on the output quality, we are reaching the limits of what is possible with WaveRNN. |
Absolutely astounding what you're all doing at Resemble, as well. Saw the LTT videos done in cooperation with you lot as well; was very happy to see some publicity in front of the average tech nerd. |
Yeah and the LTT video is using models dating from january, our sound quality has way improved since |
Hi,
For those of you who are working with this repo to synthesize different voices;
Have you noticed a huge difference between the voices generated by this repo and the samples released by the main paper here?
If yes, let's discuss and find out the reason(s).
The text was updated successfully, but these errors were encountered: