[Help] Share your TTS models #380

erogol · 2021-03-15T10:48:45Z

Please consider sharing your pre-trained models in any language (If the licences allow that).

We can include them in our model catalogue for public use by attributing your name (website, company etc.).

That would enable more people to experiment together and coordinate, instead of individual efforts to achieve similar goals.

That is also a chance to make your work more visible.

You can share in two ways;

Share the model files with us and we serve them with the next 🐸 TTS release.
Upload your models on GDrive and share the link.

Models are served under .models.json file and any model is available under tts CLI or Server end points. More details...

(previously mozilla/TTS#395)

The text was updated successfully, but these errors were encountered:

enjikaka · 2021-04-15T08:51:34Z

Any ELI5 tutorial/doc for creating a dataset for your own language/dialect?

erogol · 2021-04-15T14:32:25Z

Not sure if it is ELI5, but there is this link https://github.com/coqui-ai/TTS/wiki/What-makes-a-good-TTS-dataset

Also, @thorstenMueller has created a TTS dataset from the gecko so he might have valuable comments if you have specific questions.

thorstenMueller · 2021-04-15T20:09:22Z

Feel free to ask specific question. I'd happy to share my experiences on recording a new dataset here.

Find/Create a text corpus to record (one sentence = 1 recording)
Replace numbers to text
Create csv file from corpus
Check Mimic-Recording-Studio from Mycroft as recording environment (https://github.com/MycroftAI/mimic-recording-studio)
Start recording
- Constant speed while recordings
- Speak all chars clearly
- Speak in neutral voice
- Use good microphone equipment
- Find a recording place without random noise

Sadam1195 · 2021-04-21T12:05:04Z

Hi @erogol , thank you for the amazing work, from Mozilla TTS to coqui-ai. Although Mozilla seemed perfect to me as it had wider community reach, just hope this grows even wider and faster than Mozilla. I am planning to share my models for Spanish and Italian using (Taco2 600k steps + WaveRNN). Audio quality seems to be good but I need to train it a bit more and also ask dataset providers if that would be okay if I make the models public.
Fingers crossed.

Let me know if I can contribute in any way I have Google Colab Pro resources laying around free.

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+

erogol · 2021-04-21T12:15:22Z

@Sadam1195 thx for the amazing work 🚀🚀.

I really hope we can include your models, of course with the right attribution going to you.

Just waiting for your signal.

For general contribution, this is a nice place to start https://github.com/coqui-ai/TTS/blob/main/CONTRIBUTING.md

If you just like to train models, let me know we can also find new datasets to attack.

Sadam1195 · 2021-04-21T13:50:26Z

I really hope we can include your models, of course with the right attribution going to you.
I hope they allow me, otherwise I would see it as wasting my time and effort.
Just waiting for your signal.
I will let you know when I get the confirmation.
If you just like to train models, let me know we can also find new datasets to attack.
Training models on colab can be a bit annoying as sessions often get disconnected even with all the tricks in the book.

Nonetheless, I would love to train model on new datasets (if you have any) specially in the languages in which TTS models haven't been made public yet.

kaiidams · 2021-05-16T10:59:37Z

Hello,

I've just started to train a public domain Japanese dataset https://github.com/kaiidams/Kokoro-Speech-Dataset with Tacotron 2 of the latest master of https://github.com/mozilla/TTS on Google Colab Free. After 19K steps, I can hear what he says, although it is metallic.

To proceed, I'd like to know which branch and repo do you recommend for me to use? https://github.com/erogol/TTS_recipes seems a bit old.

Sadam1195 · 2021-05-16T11:40:18Z

To proceed, I'd like to know which branch and repo do you recommend for me to use? https://github.com/erogol/TTS_recipes seems a bit old.

Please use this https://github.com/coqui-ai/TTS instead of https://github.com/mozilla/TTS and use the latest main branch. @kaiidams

kaiidams · 2021-05-21T06:49:56Z

@Sadam1195 @erogol

I trained Tacotron 2 for 130K steps with this code https://github.com/kaiidams/TTS/tree/kaiidams/kokoro which was forked from the latest main.
https://drive.google.com/drive/folders/1-1_HB-ogmvD-qYaHm8D5Xp1pWq9HKhB_?usp=sharing
The included sample.wav was generated with vocoder_models/universal/libri-tts/wavegrad.

The input of the model is Romanized Japanese text. It requires some dependencies like MeCab to convert texts from ordinary ones.
The dataset is the public domain and the reader knows about the dataset. I think I can provide Python code for text conversion.

erogol · 2021-05-21T10:06:34Z

@kaiidams if you can send a PR for text conversion something similar to the Chinese API we have, with the model, would be a great contribution.

zubairahmed-ai · 2021-06-09T05:38:58Z

Feel free to ask specific question. I'd happy to share my experiences on recording a new dataset here.

Find/Create a text corpus to record (one sentence = 1 recording)

Replace numbers to text

Create csv file from corpus

Check Mimic-Recording-Studio from Mycroft as recording environment (https://github.com/MycroftAI/mimic-recording-studio)

Start recording

Constant speed while recordings

Speak all chars clearly

Speak in neutral voice

Use good microphone equipment

Find a recording place without random noise

Any reason why this and this isn't in the readme?
I had to look up training to reach here

thorstenMueller · 2021-06-09T11:31:37Z

Hi @zubairahmed-ai.
Here's a talk a made on how to record a voice dataset if that's helpful for you.

https://youtu.be/m-Uwb-Bg144

zubairahmed-ai · 2021-06-09T11:33:47Z

@thorstenMueller Perfect timing, thank you

zubairahmed-ai · 2021-06-09T11:35:18Z

Oh just realized this talk happened during recent Google I/O and I somehow didn't catch it while watching other videos :)

zubairahmed-ai · 2021-06-10T05:35:17Z

@thorstenMueller Thanks so much for the great video explaining your process in details with some tips. I'll make sure I follow that, do you plan to give a try to other models besides Tacotron-2? like Align-TTS?

thorstenMueller · 2021-06-10T18:39:03Z

You're welcome @zubairahmed-ai :-).
I'm currently finishing some recording stuff for my emotional dataset and train a Fullband-MelGAN vocoder. So i've no time left to look at other models like Align-TTS. But feel free to train a "Thorsten" model with Align-TTS ;-).

stale · 2021-07-10T19:01:45Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

ravi-maithrey · 2021-07-13T17:39:08Z

Asking people to share their models can also be added to the CONTRIBUTING.md, since it is asking for contributions. I'd be up to doing that, if no one has taken it up yet?

erogol · 2021-07-14T11:53:16Z

yeah good point. Feel free to take on it.

ghost · 2021-09-21T08:53:43Z

I would like to contribute my own model.. but I stuck in middle.. I have created dataset(LJSpeech) of my own voice . For training my model I need config.json file , so can anyone provide me the template of config.json file for LJSpeech dataset format required to train my model.

Thanks in Advance

erogol · 2021-09-21T10:24:24Z

@ManoBharathi93 you can start from the LJSpeech recipes in the recipes folder and change the config fields for your dataset specs. You can find more info here https://tts.readthedocs.io/en/latest/

ghost · 2021-09-21T11:18:00Z

@erogol thanks a lot sir

ghost · 2021-09-22T03:41:33Z

Hello folks, How can I add drop-down Menu to list available models(downloaded models) in WEB-UI and when I change the server.py file the web interface is not changing ? please mention which file name want to make changes impact in WEB-UI..

godspirit00 · 2021-10-08T09:15:07Z

I'd like to share a Tacotron2-DCA model and a Univnet model I trained on the Nancy corpus.

Here is a sample:

sample.mp4

The link to the models:
https://drive.google.com/drive/folders/1bMNOjjYxcCkgwkcYAlsPR3qM4hZQzAOR?usp=sharing

Thanks again for the great work!

erogol · 2021-10-09T17:02:54Z

@godspirit00 the quality is awesome.

stale · 2021-11-08T18:09:43Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

erogol added the help wanted Contributions welcome!! label Mar 15, 2021

erogol pinned this issue Mar 15, 2021

coqui-ai deleted a comment from snakers4 Apr 2, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Jul 10, 2021

erogol removed the wontfix This will not be worked on but feel free to help. label Jul 11, 2021

ravi-maithrey mentioned this issue Jul 14, 2021

added information to ask for model contributions #664

Merged

stale bot added the wontfix This will not be worked on but feel free to help. label Aug 13, 2021

coqui-ai deleted a comment from stale bot Aug 15, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Aug 15, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Sep 14, 2021

coqui-ai deleted a comment from stale bot Sep 15, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Sep 15, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Nov 8, 2021

coqui-ai locked and limited conversation to collaborators Nov 10, 2021

erogol closed this as completed Nov 10, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Nov 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

[Help] Share your TTS models #380

[Help] Share your TTS models #380

erogol commented Mar 15, 2021 •

edited

Loading

enjikaka commented Apr 15, 2021

erogol commented Apr 15, 2021

thorstenMueller commented Apr 15, 2021

Sadam1195 commented Apr 21, 2021

erogol commented Apr 21, 2021

Sadam1195 commented Apr 21, 2021 •

edited

Loading

kaiidams commented May 16, 2021

Sadam1195 commented May 16, 2021 •

edited

Loading

kaiidams commented May 21, 2021 •

edited

Loading

erogol commented May 21, 2021 •

edited

Loading

zubairahmed-ai commented Jun 9, 2021

thorstenMueller commented Jun 9, 2021

zubairahmed-ai commented Jun 9, 2021

zubairahmed-ai commented Jun 9, 2021

zubairahmed-ai commented Jun 10, 2021

thorstenMueller commented Jun 10, 2021

stale bot commented Jul 10, 2021

ravi-maithrey commented Jul 13, 2021

erogol commented Jul 14, 2021

ghost commented Sep 21, 2021

erogol commented Sep 21, 2021

ghost commented Sep 21, 2021

ghost commented Sep 22, 2021

godspirit00 commented Oct 8, 2021 •

edited

Loading

erogol commented Oct 9, 2021

stale bot commented Nov 8, 2021

This issue was moved to a discussion.

This issue was moved to a discussion.

[Help] Share your TTS models #380

[Help] Share your TTS models #380

Comments

erogol commented Mar 15, 2021 • edited Loading

enjikaka commented Apr 15, 2021

erogol commented Apr 15, 2021

thorstenMueller commented Apr 15, 2021

Sadam1195 commented Apr 21, 2021

erogol commented Apr 21, 2021

Sadam1195 commented Apr 21, 2021 • edited Loading

kaiidams commented May 16, 2021

Sadam1195 commented May 16, 2021 • edited Loading

kaiidams commented May 21, 2021 • edited Loading

erogol commented May 21, 2021 • edited Loading

zubairahmed-ai commented Jun 9, 2021

thorstenMueller commented Jun 9, 2021

zubairahmed-ai commented Jun 9, 2021

zubairahmed-ai commented Jun 9, 2021

zubairahmed-ai commented Jun 10, 2021

thorstenMueller commented Jun 10, 2021

stale bot commented Jul 10, 2021

ravi-maithrey commented Jul 13, 2021

erogol commented Jul 14, 2021

ghost commented Sep 21, 2021

erogol commented Sep 21, 2021

ghost commented Sep 21, 2021

ghost commented Sep 22, 2021

godspirit00 commented Oct 8, 2021 • edited Loading

erogol commented Oct 9, 2021

stale bot commented Nov 8, 2021

This issue was moved to a discussion.

erogol commented Mar 15, 2021 •

edited

Loading

Sadam1195 commented Apr 21, 2021 •

edited

Loading

Sadam1195 commented May 16, 2021 •

edited

Loading

kaiidams commented May 21, 2021 •

edited

Loading

erogol commented May 21, 2021 •

edited

Loading

godspirit00 commented Oct 8, 2021 •

edited

Loading