-
Notifications
You must be signed in to change notification settings - Fork 5k
Can't read Bengali year ১৯৫৪ সাল। কালো রাত। [Bug] #3815
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@khandakershahi use pybangla normalizer for your number normalization |
@khandakershahi A Bengali phonemizer/normalizer is also included directly in Coqui TTS, you can use it as follows: from TTS.tts.utils.text.phonemizers import BN_Phonemizer
bn = BN_Phonemizer()
bn.phonemize("১৯৫৪ সাল। কালো রাত।") (resulting in |
@saifulislam79 Thank you. I am just a normal user. Don't know python coding or TTS. I tried, but I didn't able to figure out how to use with my main.py code. Would it be possible to give me an update code of my main.py, so that it works with your package? Or anything else that allow to use the GUI interface. @eginhard Thank you. I didn't able to figure out how to use with my main.py code. Would it be possible to give me an update code of my main.py, so that it works with the |
@khandakershahi Try this: import torch
from TTS.api import TTS
from TTS.tts.utils.text.phonemizers import BN_Phonemizer
import gradio as gr
device = "cuda" if torch.cuda.is_available() else "cpu"
bn = BN_Phonemizer()
tts = TTS(model_name='tts_models/bn/custom/vits-male').to(device)
def generate_audio(text="তুমি কেমন আছো?"):
text = bn.phonemize(text)
tts.tts_to_file(text=text, file_path="outputs/output.wav")
return "outputs/output.wav"
demo = gr.Interface(
fn=generate_audio,
inputs=[gr.Text(label="Text"),],
outputs=[gr.Audio(label="Audio"),],
)
demo.launch() |
@eginhard Thank you. It now works. One thing is here. It only read it as number, not year. Many many thanks. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
Describe the bug
I was testing the Bengali Voice model and it missed the Bengali number pronunciation. Bengali numbers
০ ১ ২ ৩ ৪ ৫ ৬ ৭ ৮ ৯
0 1 2 3 4 5 6 7 8 9.
১৯৫৪ সাল। কালো রাত। Here is supposed to read in Bengali, the year 1954.
log:
Log shows [!] Character '৯' not found in the vocabulary. Discarding it.
To Reproduce
pip install TTS
main.py
Expected behavior
No response
Logs
No response
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: