-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Here is a model in Swedish #257
Comments
@ViktorAlm Could you make a google colab notebook of it? |
Can you share how did you do it? I want to try training a portuguese model |
@ViktorAlm I tried this out but I am getting an error message when the synthesizer checkpoint is loaded.
What I did is replace the synthesizer and vocoder with your models. I did not make any synthesizer code changes for Swedish yet. Can you let me know if you changed any files or hparams to make this work? Error message including full traceback
|
@ViktorAlm Can you share your code with me? If needed, in a private repository so I can figure out exactly what changes are needed for your Swedish pretrained models to work? |
@ViktorAlm This is the difference between your model structure and the default. The input embedding batch size changed from 66 to 72. Do you know where that could be coming from? I diffed the synthesizer code of taco2swe and this repo, and can't find where that is coming from. Using the taco2swe version of the hparams doesn't seem to fix it either.
|
This is resolved @ViktorAlm ! Just needed to add the extra characters in the Swedish alphabet to
Edit: Fixed the sound issue, I had a bad encoder model in my folder. Once fixed it worked much better, see samples in following post. Thank you for sharing your models @ViktorAlm , I do think that others will find it useful for text to speech. |
Here are some samples.zip, where I take VCTK p240 and p260 and clone an English utterance with Corentin's models, and a Swedish utterance with these models. In my opinion the Swedish performs better for this example, though they both leave something to be desired.
|
Hey! Havent followed this. Yeah the results were not that good but I managed to get it to say a few words that were difficult to distinguish. Tried it on > 100 people and only one could guess all the cloned voices. If I did it again I would test something else, like mellotron or something that followed / transformer based. In general I think you need a lot more data and cleaner than what I was using. It mixes dialects to the left and right so another voice embedding modell trained on Swedish would probably be necessary. |
Hm?? strange i got this error Could not find any synthesizer weights under synthesizer\saved_models\pretrained\taco_pretrained Can someone help me?? |
i got this error Command "python setup.py egg_info" failed with error code 1 in C:\Users\dacia\AppData\Local\Temp\pip-install-1lqbf8bh\PyQt5\ obs, i run win11 could it be why I get that error?? |
PS C:\Users\dacia\Desktop\Eudora\Real-Time-Voice-Cloning-400_pretrained_swe_301> python3 demo_toolbox.py |
hm? its gone i cant down load the required saved models for swedish version. :( cna some one please send it to my email [email protected] when i go to this link https://github.com/blue-fish/Real-Time-Voice-Cloning/tree/400_pretrained_swe_301 it says not found. i remeber there it was important files that i needed to get it to work. if some one could send them to me so it would be greate. does some one have all the files so we can restore the page?? |
All files needed to run the Swedish RTVC models from @ViktorAlm can be found here. |
What did you train your Swedish models on? I would love a pointer to some good datasets. I'm trying to replicate your Swedish Tensorflow models in PyTorch. @ViktorAlm |
Not sure how to share this, but feel free to add.
https://drive.google.com/drive/folders/1jrMWhkK3KrOKsbPMYtCCcPczPmf8V-eQ?usp=sharing
It uses the default English encoder. To adapt tacotron text pre-processing to Swedish follow this code:
https://github.com/codemandosch/taco2swe
It is far from perfect but was a fun experiment.
The text was updated successfully, but these errors were encountered: