-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v0.1.3 #674
Conversation
Change default value of `"mixed_precision" : false` as when it is set true it leads to `raise RuntimeError(f" [!] NaN loss with {key}.") RuntimeError: [!] NaN loss with decoder_loss.`
Forcing do_trim_silence to False in the extract TTS script
Bug fix on train encoder
`mixed_precision` set to false
Change to _get_preprocessor_by_name
…contributing added information to ask for model contributions
Fix test runs and wavegrad test_run
Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.
Fix stopnet training for Tacotron models
I installed this version to run some tests:
|
Hi @mbarnig . |
@thorstenMueller : I know, I installed the dev branch. I think the purpose of the present comment #674 is to get some early reactions before the release of the new version. |
@mbarnig make sure you followed installation instructions. https://tts.readthedocs.io/en/latest/installation.html |
@erogol : my fault. I was using the ancient command With
the script |
🐸 v0.1.3
🐞Bug Fixes
Fix Tacotron stopnet training
Models trained after v0.1 had the problem that the stopnet was not trained. It caused models not to generate audio
at evaluation and inference time.
Fix
test_run
at training. (👑 @WeberJulian)In training 🐸 TTS would skip the
test_run
and not generate test audio samples. Now it is fixed :).💾 Code updates
compute_embeddings.py
for efficiency and compatibility with the latest speaker encoder. (👑 @Edresson)🚀 Model releases