Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

preprocessing VoxCele2 is not working #488

Closed
amintavakol opened this issue Aug 12, 2020 · 5 comments
Closed

preprocessing VoxCele2 is not working #488

amintavakol opened this issue Aug 12, 2020 · 5 comments

Comments

@amintavakol
Copy link

While running encoder_preprocess on voxceleb2 dataset, I'm getting the following warning and nothing else happens. Not sure why?

raw: Preprocessing data for 5994 speakers.
raw:   0%|                                                                                           | 0/5994 [00:00<?, ?speakers/s]
/home/amin/.local/lib/python3.6/site-packages/librosa/core/audio.py:161: UserWarning: PySoundFile failed. Trying audioread instead.
  warnings.warn('PySoundFile failed. Trying audioread instead.')
/home/amin/.local/lib/python3.6/site-packages/librosa/core/audio.py:161: UserWarning: PySoundFile failed. Trying audioread instead.
  warnings.warn('PySoundFile failed. Trying audioread instead.')
@ghost
Copy link

ghost commented Aug 12, 2020

Could it be related to this? #76 (comment)

Also you might want to follow #458 since we're also running into issues with preprocessing and training. If you modify code to fix a problem would you please contribute it back as a pull request?

@mbdash
Copy link
Collaborator

mbdash commented Aug 12, 2020

I have mesed around with LibriSpeeech, VoxCeleb 1&2, CommonVoice and VCTK in #458

I am a bit tired so I am not sure if VoxCeleb was wav, ensure it is all converted to wav.
Also, once I succeed in training, a new encoder, I will share everything i can.

@amintavakol
Copy link
Author

amintavakol commented Aug 14, 2020

For guys who are trying to preprocess VoxCeleb2:
once you download the dataset, the audio files are in ".m4a" format. You guys need to reformat the audio files into ".wav".
Just put the following code snippet convert.sh (need to save it as .sh) in the root directory of the data (e.g. <path-to-VoxCeleb2>/raw/dev/aac)
convert.txt

then run
./convert.sh

Also, make sure you have ffmpeg installed on you machine.
You also need to modify the function preprocess_voxceleb2 in encode/preprocess.py, and change the extension to ".wav".

I'm training a new encoder with more datasets other than Libri/Vox1, 2 and will update everyone in a few days.

@ghost
Copy link

ghost commented Aug 19, 2020

Presumed resolved based on #497

@ghost ghost closed this as completed Aug 19, 2020
@ghost
Copy link

ghost commented Aug 19, 2020

Please share updates about your encoder model in #458, would be interested to see how it is working. Did you modify any hparams?

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants