You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, and thank you for this great work @CorentinJ
I have below few observations. Wanted to bring them to your notice and seek your advice in fixing them.
For a shorter input text (~<25 characters), the generated audio has gaps/noise in between words. Is there any way to prevent it(happens consistently)?
For longer input text(between 25 to 120 characters), the words are skipped in between. This is not consistent but happens quite frequently. Any reason for this and any way to prevent this?
For very long input text(>120 characters with multiple sentences), the generated output starts with a normal speed and then the speed increases to an extent it becomes difficult to comprehend. Any solution to this?
I am using the pre-trained models you have published in this repo.
Appreciate if you can share your thoughts and advice.
Thanks in advance!
The text was updated successfully, but these errors were encountered:
As a workaround, the "enhance vocoder output" option in the toolbox will also use voice activation detection to trim out these gaps if you have the webrtcvad package installed.
Have not seen this before, can you try to find an input sequence + random seed that does this? And provide the source audio file for the embed.
@kkprabhu Closing as duplicate of #411. You are welcome to reopen this issue if you have audio samples and a reproducible test case for the number 2 item (skipped words).
Hello, and thank you for this great work @CorentinJ
I have below few observations. Wanted to bring them to your notice and seek your advice in fixing them.
I am using the pre-trained models you have published in this repo.
Appreciate if you can share your thoughts and advice.
Thanks in advance!
The text was updated successfully, but these errors were encountered: