Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generated Samples are noisy #37

Open
SandyPanda-MLDL opened this issue May 15, 2024 · 1 comment
Open

Generated Samples are noisy #37

SandyPanda-MLDL opened this issue May 15, 2024 · 1 comment

Comments

@SandyPanda-MLDL
Copy link

SandyPanda-MLDL commented May 15, 2024

I have used the pretrained model as provided in the google drive of the official repo. Based on the check point of the pre-trained model when I executed the infernce.py file, the generated samples quality I observed are very noisy for different values of reverse diffusion process (10,20,30,40,50,70). If I can get any suggestion regarding the same. I have used the ckpt of Libri-TTS model (not the LJ-Speech). However, in the demo page the quality of the samples are sufficiently good.

@li1jkdaw
Copy link

Hi! You can check this issue. In brief - multi-speaker checkpoint trained on LibriTTS is provided only to show the possibility of making GradTTS work in multi-speaker setting, and its quality may not be good for arbitrary speaker. All the results in the paper were obtained in single-speaker setting (LJ-Speech checkpoint).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants