You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
we are trying to train wav2vec-u on Korean data but our GAN model does not converge well. Are the parameters used to train the model presented in the paper the same as those included in the repo? Also, are the parameters for the multilingual GAN the same as the English model?
Code
What have you tried?
We successfully replicated the Librispeech experiment with English data. We prepared a Korean wav2vec 2.0 model with the public AI hub Korean dataset, but the GAN model does not converge when trained on this same data.
( it is not decreasing until 150K)
We also tried training the GAN using the XLRS model as an encoder using English Librispeech 100 dataset, and it also does not converge well (with hyper parameter range in paper and recipe). The valididation weighted PPL does not fall below ~60.
( it is not decreasing until 150K)
What's your environment?
fairseq Version (e.g., 1.0 or main): 1.0.0a0+741fd13
@KimJeongSun Hello, I have the same problem with wav2vec-u experiment on the Mandarin data set. I guess it may be related to GAN's training hyperparameter settings, but I have adjusted the hyperparameters many times and have not achieved reasonable results. If you have new progress, please share it.
This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!
Closing this issue after a prolonged period of inactivity. If this issue is still present in the latest release, please create a new issue with up-to-date information. Thank you!
❓ Questions and Help
Before asking:
What is your question?
we are trying to train wav2vec-u on Korean data but our GAN model does not converge well. Are the parameters used to train the model presented in the paper the same as those included in the repo? Also, are the parameters for the multilingual GAN the same as the English model?
Code
What have you tried?
We successfully replicated the Librispeech experiment with English data. We prepared a Korean wav2vec 2.0 model with the public AI hub Korean dataset, but the GAN model does not converge when trained on this same data.
data:image/s3,"s3://crabby-images/4c9de/4c9de5f705bce40a19b6a787752d32aa2685c5b4" alt="image"
( it is not decreasing until 150K)
We also tried training the GAN using the XLRS model as an encoder using English Librispeech 100 dataset, and it also does not converge well (with hyper parameter range in paper and recipe). The valididation weighted PPL does not fall below ~60.
data:image/s3,"s3://crabby-images/ffdc9/ffdc90b29e4d47cff904fbf3f6800eb442643f13" alt="image"
data:image/s3,"s3://crabby-images/cc66d/cc66d0eb0fffb9262bd9ca07242e11005d3d50db" alt="image"
data:image/s3,"s3://crabby-images/035e4/035e43824f5485912a17c6bb5d82d995be0c708c" alt="image"
( it is not decreasing until 150K)
What's your environment?
pip
, source): git clone https://github.com/pytorch/fairseq.git && cd fairseq && git reset --hard 741fd13The text was updated successfully, but these errors were encountered: