-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feedback: My experience using this (very impressive) project #360
Comments
.. by the way @CorentinJ I'd be happy to attribute your project or resemble.ai as you prefer in the skill description. It's non-commercial (i.e. free) |
Skill store links for those who are interested in the output: US store: https://www.amazon.com/dp/B08B59XJLY I've posted about it in a few Reddit forums and referenced this project. |
Thanks for sharing @plummet555 ! I have a question and a suggestion:
Can you please share the changes? This would help me for #384 where I am trying to make the output more consistent.
You can try this vocoder model with additional training: #126 (comment) The speech quality is nearly identical but I find it cuts down on these types of artifacts. If you try it out I'd like to know if it worked for you. |
Closing this issue due to inactivity, feel free to reopen. I would appreciate an answer on how to set the tacotron2 hardcoded seeds for better repeatability. |
Hi @blue-fish - sorry I forgot to reply earlier. I've just shared my repo with you. I didn't make it public as it is a bit messy but hopefully it will help you. Look at the changes to tacotron2 in the 'silence detection and seeding' commit. |
Thank you @plummet555 ! I found the changes you were describing. I'll add a note to #384 . If you didn't notice my comment about updated vocoder model above, you can try plugging that in (no change to hparams needed) and see if the audio quality gets better. I've noticed fewer artifacts but no difference in voice. #126 (comment) |
Not an issue - just thought my experiences may be of use to some as this project.
I used it to build an Alexa skill that will read out Trump's latest tweet using his simulated voice. It's currently in certification - will be called 'Robo Trump AI Tweets' when it is published.
I think the results are pretty good. I had to do a few things though:
The text was updated successfully, but these errors were encountered: