Skip to content

Commit 5ab2601

Browse files
committed
update readme for aishell3 hifigan, test=tts
1 parent c4035f8 commit 5ab2601

File tree

4 files changed

+29
-5
lines changed

4 files changed

+29
-5
lines changed

docs/source/released_model.md

+2-1
Original file line numberDiff line numberDiff line change
@@ -49,11 +49,12 @@ Model Type | Dataset| Example Link | Pretrained Models| Static Models|Size (stat
4949
WaveFlow| LJSpeech |[waveflow-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc0)|[waveflow_ljspeech_ckpt_0.3.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/waveflow/waveflow_ljspeech_ckpt_0.3.zip)|||
5050
Parallel WaveGAN| CSMSC |[PWGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc1)|[pwg_baker_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_ckpt_0.4.zip)|[pwg_baker_static_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_static_0.4.zip)|5.1MB|
5151
Parallel WaveGAN| LJSpeech |[PWGAN-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc1)|[pwg_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_ljspeech_ckpt_0.5.zip)|||
52-
Parallel WaveGAN|AISHELL-3 |[PWGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc1)|[pwg_aishell3_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_aishell3_ckpt_0.5.zip)|||
52+
Parallel WaveGAN| AISHELL-3 |[PWGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc1)|[pwg_aishell3_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_aishell3_ckpt_0.5.zip)|||
5353
Parallel WaveGAN| VCTK |[PWGAN-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/voc1)|[pwg_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_vctk_ckpt_0.5.zip)|||
5454
|Multi Band MelGAN | CSMSC |[MB MelGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc3) | [mb_melgan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_ckpt_0.1.1.zip) <br>[mb_melgan_baker_finetune_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_baker_finetune_ckpt_0.5.zip)|[mb_melgan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_static_0.1.1.zip) |8.2MB|
5555
Style MelGAN | CSMSC |[Style MelGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc4)|[style_melgan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/style_melgan/style_melgan_csmsc_ckpt_0.1.1.zip)| | |
5656
HiFiGAN | CSMSC |[HiFiGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc5)|[hifigan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_ckpt_0.1.1.zip)|[hifigan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_static_0.1.1.zip)|50MB|
57+
HiFiGAN | AISHELL-3 |[HiFiGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc5)|[hifigan_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_ckpt_0.2.0.zip)|||
5758
WaveRNN | CSMSC |[WaveRNN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc6)|[wavernn_csmsc_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_ckpt_0.2.0.zip)|[wavernn_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_static_0.2.0.zip)|18MB|
5859

5960

examples/aishell3/voc5/README.md

+15-1
Original file line numberDiff line numberDiff line change
@@ -135,8 +135,22 @@ optional arguments:
135135
3. `--test-metadata` is the metadata of the test dataset. Use the `metadata.jsonl` in the `dev/norm` subfolder from the processed directory.
136136
4. `--output-dir` is the directory to save the synthesized audio files.
137137
5. `--ngpu` is the number of gpus to use, if ngpu == 0, use cpu.
138-
139138
## Pretrained Models
139+
The pretrained model can be downloaded here [hifigan_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_ckpt_0.2.0.zip).
140+
141+
142+
Model | Step | eval/generator_loss | eval/mel_loss| eval/feature_matching_loss
143+
:-------------:| :------------:| :-----: | :-----: | :--------:
144+
default| 1(gpu) x 2500000|24.060|0.1068|7.499
145+
146+
HiFiGAN checkpoint contains files listed below.
147+
148+
```text
149+
hifigan_aishell3_ckpt_0.2.0
150+
├── default.yaml # default config used to train hifigan
151+
├── feats_stats.npy # statistics used to normalize spectrogram when training hifigan
152+
└── snapshot_iter_2500000.pdz # generator parameters of hifigan
153+
```
140154

141155
## Acknowledgement
142156
We adapted some code from https://github.com/kan-bayashi/ParallelWaveGAN.

paddlespeech/t2s/exps/synthesize.py

+1
Original file line numberDiff line numberDiff line change
@@ -156,6 +156,7 @@ def parse_args():
156156
choices=[
157157
'pwgan_csmsc', 'pwgan_ljspeech', 'pwgan_aishell3', 'pwgan_vctk',
158158
'mb_melgan_csmsc', 'wavernn_csmsc', 'hifigan_csmsc',
159+
'hifigan_ljspeech', 'hifigan_aishell3', 'hifigan_vctk',
159160
'style_melgan_csmsc'
160161
],
161162
help='Choose vocoder type of tts task.')

paddlespeech/t2s/exps/synthesize_e2e.py

+11-3
Original file line numberDiff line numberDiff line change
@@ -180,9 +180,17 @@ def parse_args():
180180
type=str,
181181
default='pwgan_csmsc',
182182
choices=[
183-
'pwgan_csmsc', 'pwgan_ljspeech', 'pwgan_aishell3', 'pwgan_vctk',
184-
'mb_melgan_csmsc', 'style_melgan_csmsc', 'hifigan_csmsc',
185-
'wavernn_csmsc'
183+
'pwgan_csmsc',
184+
'pwgan_ljspeech',
185+
'pwgan_aishell3',
186+
'pwgan_vctk',
187+
'mb_melgan_csmsc',
188+
'style_melgan_csmsc',
189+
'hifigan_csmsc',
190+
'hifigan_ljspeech',
191+
'hifigan_aishell3',
192+
'hifigan_vctk',
193+
'wavernn_csmsc',
186194
],
187195
help='Choose vocoder type of tts task.')
188196
parser.add_argument(

0 commit comments

Comments
 (0)