Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix for trim_to_alignments issue #1193

Merged
merged 60 commits into from
Nov 5, 2023
Merged

Conversation

desh2608
Copy link
Collaborator

Ref: #1186

The key issue was that in a couple of places, we did not account for the fact that start time of AlignmentItem is w.r.t. the start of the recording, whereas that for SupervisionSegment is w.r.t. the Cut.

  • To create supervision segment from an alignment item, we need to subtract the start time of the cut.
  • In with_offset function for the supervision, alignment start times do not change, since they are w.r.t. the recording.

desh2608 added 30 commits April 20, 2023 11:26
@desh2608 desh2608 linked an issue Oct 18, 2023 that may be closed by this pull request
@desh2608 desh2608 added this to the v1.18 milestone Oct 18, 2023
@@ -363,15 +363,15 @@ def test_cut_trim_to_supervisions_keep_overlapping_extend(mono_cut):
# Extended on the right side only by (4.0 - 3.37) / 2 == 0.315;
# the left side is capped by the start of the recording.
assert len(c1.supervisions) == 1
assert c1.start == 0.0
assert c1.start == 1.0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After changing the mono_cut fixture, this test case no longer represents the illustrated scenario. Also the comment above seems to be in conflict with the test outcome (looks like c1 should have 4 seconds now that it's not capped by the start of the recording anymore)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. I'll fix the tests tomorrow.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry I lost track of this. Have made the appropriate changes now --- should not affect the older tests.

@desh2608 desh2608 requested a review from pzelasko November 2, 2023 19:14
@pzelasko pzelasko enabled auto-merge (squash) November 5, 2023 13:29
@pzelasko pzelasko merged commit db40bc4 into lhotse-speech:master Nov 5, 2023
9 of 10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

trim-to-alignments produces negative utterance duration on libriheavy
2 participants