-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Character not found in vocabulary , is it normal ? #6
Comments
you probably have done something wrong,if you check here : https://www.kaggle.com/datasets/mobassir/comprehensive-bangla-tts you will see my trained log saved as .txt file and if you check my trainig log you won't see characters getting discarded, i have used this script to train vits with phoneme : https://github.com/mobassir94/comprehensive-bangla-tts/blob/main/bn_vits_tts/Bangla_phoneme_ViTS_trainer.ipynb |
Yes I have used exactly your dataset downloaded from kaggle .. Because I like to do it on your dataset first. Yes I might miss something. Let me try it from the beginning once again and yes I am using https://github.com/mobassir94/comprehensive-bangla-tts/blob/main/bn_vits_tts/Bangla_phoneme_ViTS_trainer.ipynb . Just changed the location parameter to run it . Still I might miss somewhere let me try it from the beginning once again . Thanks again for continuous help and support . |
Make sure this code prints absolute path of espeak and espeak-ng : At least you should see path of espeak-ng |
Yes espeak-ng having absolute path , and after rerun the code having the same issue . But as I can see #phonemizer="espeak-ng",# multi_phonemizer espeak-ng is commented out here .. if that is the case is there any other uses of espeak-ng in other module ? and aslo characters_config is commented out .. do you think that could cause this error ? |
#phonemizer="espeak-ng", is commented out because iirc by default phonemizer parameter selects espeak for bangla. you can uncomment and see if that was the case. |
Thanks for reply .. I have just used following characters_config and sample down to 1000 just to see the quick result and affter just 500 epoch I have started to get some good result . With just 1000 sample epoch is taking like 30 seoncds right now . So it's just a result of 4-5 hours of training .. I am still continuing the training just to see optimal outcome of 1000 samples. affer 500 epoch https://drive.google.com/file/d/1YsrikAe4vsAYCDO6eb-HQoN9L4ErRHz4/view?usp=drive_link and character_config I have used characters_config = CharactersConfig( so far so good .. let see what came out after 5000 epoch .. I will keep updating here and if I have any question in future. and Lastly thanks for all help. |
@desis123 good work. |
So you think it will not improve anymore ?? or should I continue at least 1000 ? |
@desis123 as you are not using phoneme currently,so you can train more and hopefully performance will be improved,,,i said "if you use phoneme then you don't have to train for many epochs" |
First of all thanks for getting a prompt reply for my previous question . I am sorry for disturbing you over and over . As I am pretty new to machine learning along with TTS . While I am running your training script at the beginning I am getting this following message
is it normal ?, I am not clear why characters not found is happening ?
second question after that script is running and currently on 256 epoch and each epoch is taking around 2 mins . I am running with 3090TI gpu , is there anyway to improve this timing ?
The text was updated successfully, but these errors were encountered: