Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add trim_to_supervision_groups method #930

Merged
merged 2 commits into from
Jan 2, 2023

Conversation

desh2608
Copy link
Collaborator

@desh2608 desh2608 commented Dec 19, 2022

This PR adds another way of trimming a cut set --- by trimming to "supervision groups". A supervision group (or an utterance group, see Fig. 2 in this paper) is defined as a group of contiguous supervisions that are either overlapping or separated by a small pause (shorter than some max_pause threshold. Such a way of trimming is useful for evaluating multi-talker ASR.

For example, the following cut::

                                        Cut
╔═════════════════════════════════════════════════════════════════════════════════╗
║┌──────────────────────┐                              ┌────────┐                 ║
║│ Hello this is John.  │                              │   Hi   │                 ║
║└──────────────────────┘                              └────────┘                 ║
║            ┌──────────────────────────────────┐            ┌───────────────────┐║
║            │     Hey, John. How are you?      │            │  What do you do?  │║
║            └──────────────────────────────────┘            └───────────────────┘║
╚═════════════════════════════════════════════════════════════════════════════════╝

is transformed into two cuts::

                    Cut 1                                       Cut 2
╔════════════════════════════════════════════════╗    ╔═══════════════════════════╗
║┌──────────────────────┐                        ║    ║┌────────┐                 ║
║│ Hello this is John.  │                        ║    ║│   Hi   │                 ║
║└──────────────────────┘                        ║    ║└────────┘                 ║
║            ┌──────────────────────────────────┐║    ║      ┌───────────────────┐║
║            │     Hey, John. How are you?      │║    ║      │  What do you do?  │║
║            └──────────────────────────────────┘║    ║      └───────────────────┘║
╚════════════════════════════════════════════════╝    ╚═══════════════════════════╝

@pzelasko
Copy link
Collaborator

pzelasko commented Jan 2, 2023

Thanks!

@pzelasko pzelasko merged commit 2e6adb9 into lhotse-speech:master Jan 2, 2023
@pzelasko pzelasko added this to the v1.12 milestone Jan 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants