-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there any experiment on Chinese data set. #91
Comments
Does anyone know why that is? Or is there a Chinese data set with experimental success? What methods do you use to phoneme Chinese texts? |
I am sorry, I haven't trained a Chinese dataset, but I can assure that the model training is language independent. There are forks in Krygz https://github.com/UlutSoftLLC/MamtilTTS and Catalan https://huggingface.co/projecte-aina/matxa-tts-cat-multiaccent . So perhaps someone who has trained on a Chinese dataset can chip into the conversation. Just to confirm, did you see this page? https://github.com/shivammehta25/Matcha-TTS/wiki/Training-%F0%9F%8D%B5-Matcha%E2%80%90TTS-with-different-dataset-&-languages |
Hello author, thank you for your anwser !!! I trained the model on a chinese dataset AISHELL3 ,119 epochs, poor reception What do you think is the reason? how can i improve the synthesis ? |
I think the dataset size and training should be enough.
|
thank you foryour anwser ,shivammehta25。 |
So do you mean that using the Mainland Chinese version of Pinyin instead would not cause this problem? (I am also trying to use the method on Chinese dataset, I think this method is truly interesting) |
应该是都可以的,我之前的数据处理有问题, |
We have a Matcha-TTS recipe for the Chinese baker dataset. You can find the recipe at Note: We have provided a runtime for it. Please see sherpa-onnx We provide APIs in 12 programming languages for deploying MatchaTTS models on different platforms, e.g., Android, iOS, HarmonyOS, Raspberry Pi, etc. You can find a prebuilt Android APK for it at See also |
May I ask if there is any experiment on Chinese data set? Why I use pinyin as phoneme training on Chinese Mandarin data set, and what I synthesize is all noise?
The text was updated successfully, but these errors were encountered: