Add VoxConverse recipe #1142

flyingleafe · 2023-09-13T06:27:41Z

From here:

"VoxConverse is an audio-visual diarisation dataset consisting of multispeaker clips of human speech, extracted from YouTube videos.
Updates and additional information about the dataset can be found at our website (https://www.robots.ox.ac.uk/~vgg/data/voxconverse/index.html)."

Note: The default dev/test split is quite weird - namely, the test set is larger than the dev set... Hence, there is an option to use "dev" set as "train", and split "test" set in half into "dev" and "test", which can be disabled in the recipe.

flyingleafe · 2023-09-13T07:01:16Z

@pzelasko ^ quite weird that a random test for a particular python version, totally irrelevant to the changes in the PR, fails

pzelasko · 2023-09-13T12:43:09Z

@pzelasko ^ quite weird that a random test for a particular python version, totally irrelevant to the changes in the PR, fails

Some tests depend on RNG and once in a while they get flaky. It's happening a bit too often though, so I'm trying to fix that in #1143

pzelasko · 2023-09-13T12:49:54Z

lhotse/recipes/voxconverse.py

+def prepare_voxconverse(
+    corpus_dir: Pathlike,
+    output_dir: Optional[Pathlike] = None,
+    split_test: bool = True,  # test part is larger than dev part - split it into dev and test by default


Is this a standard thing to do with this dataset? If not, it would be better to return the splits as defined by the creators by default.

flyingleafe · 2023-09-14T04:10:58Z

@pzelasko yes, you're right, probably should not do the resplit by default, changed that

pzelasko

Thanks!

Add VoxConverse recipe

5a4f7b7

pzelasko reviewed Sep 13, 2023

View reviewed changes

pzelasko and others added 2 commits September 13, 2023 12:47

Merge branch 'master' into voxconverse-recipe

8d0a6cd

Do not resplit the dataset by default

0615651

flyingleafe force-pushed the voxconverse-recipe branch from 93d264e to 0615651 Compare September 14, 2023 04:10

pzelasko added this to the v1.17 milestone Sep 14, 2023

pzelasko approved these changes Sep 14, 2023

View reviewed changes

pzelasko merged commit 1389de4 into lhotse-speech:master Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add VoxConverse recipe #1142

Add VoxConverse recipe #1142

flyingleafe commented Sep 13, 2023

flyingleafe commented Sep 13, 2023

pzelasko commented Sep 13, 2023

pzelasko Sep 13, 2023

flyingleafe commented Sep 14, 2023

pzelasko left a comment

Add VoxConverse recipe #1142

Add VoxConverse recipe #1142

Conversation

flyingleafe commented Sep 13, 2023

flyingleafe commented Sep 13, 2023

pzelasko commented Sep 13, 2023

pzelasko Sep 13, 2023

Choose a reason for hiding this comment

flyingleafe commented Sep 14, 2023

pzelasko left a comment

Choose a reason for hiding this comment