Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Recipe] VoxPopuli #1089

Merged
merged 26 commits into from
Jul 19, 2023
Merged

[Recipe] VoxPopuli #1089

merged 26 commits into from
Jul 19, 2023

Conversation

desh2608
Copy link
Collaborator

No description provided.

import torch
import torchaudio
from torch.hub import download_url_to_file
from torchaudio.datasets.utils import _extract_tar
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this fails in CI but I think we have our own util for that (lhotse.utils.safe_extract)

@pzelasko
Copy link
Collaborator

IMO LGTM after fixing and if somebody wants to add s2s mode later, they can add it.

@desh2608
Copy link
Collaborator Author

IMO LGTM after fixing and if somebody wants to add s2s mode later, they can add it.

Yeah, I am already using the ASR prep part for some of the projects at JSALT. I like our method more than the official data preparation since it does not duplicate all the segments. I will fix the extract and then maybe we can merge.

@desh2608 desh2608 marked this pull request as ready for review July 19, 2023 13:34
@desh2608 desh2608 added this to the v1.16 milestone Jul 19, 2023
@pzelasko pzelasko enabled auto-merge (squash) July 19, 2023 16:26
@pzelasko pzelasko merged commit 10e7191 into lhotse-speech:master Jul 19, 2023
@desh2608 desh2608 deleted the recipe/voxpopuli branch November 2, 2023 19:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants