Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wav2vec-u GAN training not converging on Korean dataset #4077

Closed
KimJeongSun opened this issue Dec 15, 2021 · 4 comments
Closed

wav2vec-u GAN training not converging on Korean dataset #4077

KimJeongSun opened this issue Dec 15, 2021 · 4 comments

Comments

@KimJeongSun
Copy link

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

we are trying to train wav2vec-u on Korean data but our GAN model does not converge well. Are the parameters used to train the model presented in the paper the same as those included in the repo? Also, are the parameters for the multilingual GAN the same as the English model?

Code

What have you tried?

We successfully replicated the Librispeech experiment with English data. We prepared a Korean wav2vec 2.0 model with the public AI hub Korean dataset, but the GAN model does not converge when trained on this same data.
( it is not decreasing until 150K)
image

We also tried training the GAN using the XLRS model as an encoder using English Librispeech 100 dataset, and it also does not converge well (with hyper parameter range in paper and recipe). The valididation weighted PPL does not fall below ~60.
( it is not decreasing until 150K)
image
image
image

What's your environment?

  • fairseq Version (e.g., 1.0 or main): 1.0.0a0+741fd13
  • PyTorch Version (e.g., 1.0): 1.8.0+cu111
  • OS (e.g., Linux): ubuntu 20.04
  • How you installed fairseq (pip, source): git clone https://github.com/pytorch/fairseq.git && cd fairseq && git reset --hard 741fd13
  • Build command you used (if compiling from source): python3 -m pip install --editable ./
  • Python version: 3.8.10
  • CUDA/cuDNN version: cuda_11.1
  • GPU models and configuration: A100 x 8
  • Any other relevant information:
@lsrami
Copy link

lsrami commented Jan 2, 2022

@KimJeongSun Hello, I have the same problem with wav2vec-u experiment on the Mandarin data set. I guess it may be related to GAN's training hyperparameter settings, but I have adjusted the hyperparameters many times and have not achieved reasonable results. If you have new progress, please share it.

@stale
Copy link

stale bot commented Apr 16, 2022

This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!

@stale stale bot added the stale label Apr 16, 2022
@stale
Copy link

stale bot commented May 1, 2022

Closing this issue after a prolonged period of inactivity. If this issue is still present in the latest release, please create a new issue with up-to-date information. Thank you!

@stale stale bot closed this as completed May 1, 2022
@XR1988
Copy link

XR1988 commented Dec 19, 2024

Thanks for your work. What's the current status? I'm not getting good results; my UER is stuck around 90.
I've cloned this repo (it might have environment problems new): https://github.com/oneapi-src/ai-transcribe
Others cloned it with a virtual environment: https://github.com/voidful/wav2vec-u-exp
I'm having trouble with this: #5572
@KimJeongSun

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants