Issues with outputs of short and long texts #459

kkprabhu · 2020-07-30T14:05:46Z

Hello, and thank you for this great work @CorentinJ
I have below few observations. Wanted to bring them to your notice and seek your advice in fixing them.

For a shorter input text (~<25 characters), the generated audio has gaps/noise in between words. Is there any way to prevent it(happens consistently)?
For longer input text(between 25 to 120 characters), the words are skipped in between. This is not consistent but happens quite frequently. Any reason for this and any way to prevent this?
For very long input text(>120 characters with multiple sentences), the generated output starts with a normal speed and then the speed increases to an extent it becomes difficult to comprehend. Any solution to this?

I am using the pre-trained models you have published in this repo.
Appreciate if you can share your thoughts and advice.
Thanks in advance!

ghost · 2020-07-30T14:13:11Z

See Fixing the synthesizer's gaps in spectrograms #53. It gets a little better with a LibriTTS-trained model: Training a new model based on LibriTTS #449 (comment) .
- As a workaround, the "enhance vocoder output" option in the toolbox will also use voice activation detection to trim out these gaps if you have the webrtcvad package installed.
Have not seen this before, can you try to find an input sequence + random seed that does this? And provide the source audio file for the embed.
Issue was also reported in Re: speed or rate of talking - generated audio speaking way too fast #347 (don't have an explanation yet)

Also see #411 for discussion about some things that should be changed or improved.

ghost · 2020-08-07T07:42:32Z

@kkprabhu Closing as duplicate of #411. You are welcome to reopen this issue if you have audio samples and a reproducible test case for the number 2 item (skipped words).

ghost closed this as completed Aug 7, 2020

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with outputs of short and long texts #459

Issues with outputs of short and long texts #459

kkprabhu commented Jul 30, 2020

ghost commented Jul 30, 2020 •

edited by ghost

Loading

ghost commented Aug 7, 2020

Issues with outputs of short and long texts #459

Issues with outputs of short and long texts #459

Comments

kkprabhu commented Jul 30, 2020

ghost commented Jul 30, 2020 • edited by ghost Loading

ghost commented Aug 7, 2020

ghost commented Jul 30, 2020 •

edited by ghost

Loading