v0.1.3 #674

erogol · 2021-07-24T10:27:08Z

🐸 v0.1.3

🐞Bug Fixes

Fix Tacotron stopnet training

Models trained after v0.1 had the problem that the stopnet was not trained. It caused models not to generate audio
at evaluation and inference time.
Fix test_run at training. (👑 @WeberJulian)

In training 🐸 TTS would skip the test_run and not generate test audio samples. Now it is fixed :).

💾 Code updates

Refactoring in compute_embeddings.py for efficiency and compatibility with the latest speaker encoder. (👑 @Edresson)

🚀 Model releases

New Fullband-MelGAN model for Throsten German dataset. (👑 @thorstenMueller)

Change default value of `"mixed_precision" : false` as when it is set true it leads to `raise RuntimeError(f" [!] NaN loss with {key}.") RuntimeError: [!] NaN loss with decoder_loss.`

Forcing do_trim_silence to False in the extract TTS script

Bug fix on train encoder

`mixed_precision` set to false

Change to _get_preprocessor_by_name

…contributing added information to ask for model contributions

Fix test runs and wavegrad test_run

Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.

Fix stopnet training for Tacotron models

mbarnig · 2021-07-24T16:08:23Z

I installed this version to run some tests:

Traceback (most recent call last):
  File "TTS/server/server.py", line 10, in <module>
    from flask import Flask, render_template, request, send_file
ModuleNotFoundError: No module named 'flask'

thorstenMueller · 2021-07-24T16:21:55Z

Hi @mbarnig .
According to https://github.com/coqui-ai/TTS/releases the new version has not been released yet.

mbarnig · 2021-07-24T17:06:06Z

@thorstenMueller : I know, I installed the dev branch. I think the purpose of the present comment #674 is to get some early reactions before the release of the new version.

erogol · 2021-07-25T09:28:39Z

@mbarnig make sure you followed installation instructions. https://tts.readthedocs.io/en/latest/installation.html

mbarnig · 2021-07-25T11:09:10Z

@erogol : my fault. I was using the ancient command pip install -e . to install from source.

With

make system-deps
make install

the script python3 TTS/server/server.py works without error.
In the future I will always check if the documentation has been changed before raising an issue.
Thanks for your quick response.

Edresson and others added 30 commits June 5, 2021 03:12

Create a batch for more fast inference on LSTM Speaker Encoder

14b209c

mixed_precision set to false

d0cef71

Change default value of `"mixed_precision" : false` as when it is set true it leads to `raise RuntimeError(f" [!] NaN loss with {key}.") RuntimeError: [!] NaN loss with decoder_loss.`

Forcing do_trim_silence to False in the extract TTS script

b0aa189

Fix #571

d85ee90

Merge branch 'dev' into fix-extract-tts-script

121d5cf

Merge pull request #572 from a-froghyar/fix-extract-tts-script

79241b6

Forcing do_trim_silence to False in the extract TTS script

Compute embeddings and find characters using config file

b74b510

Merge branch 'dev' into dev

e78e3cd

fix Lint checks

8364405

fix Lint checks

28bec23

fix Lint checks

99d40e9

Merge branch 'dev' into dev

eb84bb2

use speaker manager on compute embeddings script

1c4e806

Change to _get_preprocessor_by_name

6e3e6d5

bug fix on train_encoder and unit tests

4eac1c4

Merge pull request #642 from Edresson/main-trainer

d4b1c17

Bug fix on train encoder

Merge pull request #562 from Sadam1195/patch-1

639b02f

`mixed_precision` set to false

Merge pull request #628 from Aloento/patch-2

93a74cb

Change to _get_preprocessor_by_name

Merge fix and eval split as argparse

2e5baff

lint fix and eval as argparse in extract tts spectrograms

d906fea

Fix test sentences synthesis

32974dd

Fix tests

7d92b30

refix linter

c79a82e

added information to ask for model contributions

98d88b8

Merge pull request #664 from ravi-maithrey/adding_call_for_models_to_…

162696d

…contributing added information to ask for model contributions

remove ignore generate eval flag

b1620d1

Changes for review

25832eb

Fix WaveGrad test_run

58cc414

Fix linter issues

05c75aa

Merge pull request #667 from coqui-ai/fix-test-sentences

9bb7f31

Fix test runs and wavegrad test_run

Edresson and others added 8 commits July 21, 2021 07:16

Add docstring to compute_embeddings script

d5adc35

Merge pull request #581 from Edresson/dev

30eed34

Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.

Fix #618

e1f0ea5

Update dataset URL

377b379

added information to ask for model contributions

d7a9965

Fix stopnet training

fc0c460

Update max_decoder_steps in tacotron recipes

d435fd7

Merge pull request #673 from coqui-ai/fix_stopnet

75b201c

Fix stopnet training for Tacotron models

erogol added the 🚀 new version label Jul 24, 2021

erogol added 3 commits July 26, 2021 15:38

Fix server.py for multi-speaker models

764f684

Add fullband-melgan DE vocoder

4b7b88d

Update default vocoder for de-thorsten

febd610

erogol merged commit d0292dd into main Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.3 #674

v0.1.3 #674

erogol commented Jul 24, 2021 •

edited

Loading

mbarnig commented Jul 24, 2021

thorstenMueller commented Jul 24, 2021

mbarnig commented Jul 24, 2021

erogol commented Jul 25, 2021

mbarnig commented Jul 25, 2021

v0.1.3 #674

v0.1.3 #674

Conversation

erogol commented Jul 24, 2021 • edited Loading

🐸 v0.1.3

🐞Bug Fixes

💾 Code updates

🚀 Model releases

mbarnig commented Jul 24, 2021

thorstenMueller commented Jul 24, 2021

mbarnig commented Jul 24, 2021

erogol commented Jul 25, 2021

mbarnig commented Jul 25, 2021

erogol commented Jul 24, 2021 •

edited

Loading