-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Montreal Forced Alignment (MFA) Version Inquiry #39
Comments
Hi, @zeynabyousefi ! We used MFA v1.0. As for the English model, meta.yaml file states that it was version v0.9.0, architecture gmm+hmm, feats mfcc+deltas. |
Thanks . Additionally, I've encountered errors while running the get_avg_mels.ipynb file, which seem to be due to mismatches in sample rates, audio features (such as MFCC or Mel spectrogram), or other processing parameters. If specific settings are required for data preprocessing and input parameters, please provide detailed instructions. Thank you in advance for your assistance! |
@zeynabyousefi
I'd also want to mention that training DiffVC Average Voice Encoder on LJSpeech only is not a good idea unless you want to perform one-to-any voice conversion where source voice is always LJ. The main idea behind this Encoder is that it should convert any voice into some speaker-independent "average" voice preserving linguistic content of the source speech. It is supposed to be used in any-to-any voice conversion to transform any source voice to "average" voice thus helping to perform disentanglement between content and timbre. But if you train this Encoder only on some specific voice, it won't be able to perform properly on any voice, it will only work as expected on that particular voice. So, if you want to achieve any-to-any voice conversion, you'd better train the Encoder on as many different voices as possible. |
Hello, I would like to know the exact version of Montreal Forced Alignment (MFA) used in this project. I need to confirm the version to ensure compatibility with other project components.
@ytyeung
@wenyong-h
@ivanvovk
@huawei-noah-admin
The text was updated successfully, but these errors were encountered: