-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Hackathon 7th] fix Tacotron2 with CSMSC #3905
base: develop
Are you sure you want to change the base?
Conversation
Thanks for your contribution! |
num_train = 9800 | ||
num_dev = 100 | ||
# split data into 3 sections, the max number of dev/test is 10% or 100 | ||
num_dev = min(int(len(wav_files) * 0.1), 100) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这是在干啥
examples/csmsc/tts0/README.md
Outdated
The structure of the folder is listed below. | ||
|
||
```text | ||
datasets/BZNSYP |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
datasets/BZNSYP should have three folders:
└─ Wave
└─ .wav files (audio speech)
└─ PhoneLabeling
└─ .interval files (alignment between phoneme and duration)
└─ ProsodyLabeling
└─ 000001-010000.txt (text with prosodic by pinyin)
Still we only use .wav files in training.
This pull request is now in conflict :( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
Others
PR changes
Others
Describe