-
Notifications
You must be signed in to change notification settings - Fork 223
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add VoxConverse recipe #1142
Add VoxConverse recipe #1142
Conversation
@pzelasko ^ quite weird that a random test for a particular python version, totally irrelevant to the changes in the PR, fails |
lhotse/recipes/voxconverse.py
Outdated
def prepare_voxconverse( | ||
corpus_dir: Pathlike, | ||
output_dir: Optional[Pathlike] = None, | ||
split_test: bool = True, # test part is larger than dev part - split it into dev and test by default |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this a standard thing to do with this dataset? If not, it would be better to return the splits as defined by the creators by default.
93d264e
to
0615651
Compare
@pzelasko yes, you're right, probably should not do the resplit by default, changed that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
From here:
"VoxConverse is an audio-visual diarisation dataset consisting of multispeaker clips of human speech, extracted from YouTube videos.
Updates and additional information about the dataset can be found at our website (https://www.robots.ox.ac.uk/~vgg/data/voxconverse/index.html)."
Note: The default dev/test split is quite weird - namely, the test set is larger than the dev set... Hence, there is an option to use "dev" set as "train", and split "test" set in half into "dev" and "test", which can be disabled in the recipe.