-
Notifications
You must be signed in to change notification settings - Fork 225
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix load_kaldi_dara_dir not loading segments and feats.scp correctly #1005
Conversation
Will look today
Y.
…On Wed, Mar 22, 2023 at 11:44 Piotr Żelasko ***@***.***> wrote:
Thanks! I think it looks good, but would be great if we could get a second
pair of eyes that's more familiar with Kaldi compatibility code CC
@desh2608 <https://github.com/desh2608> @jtrmal
<https://github.com/jtrmal> could one of you also check it?
—
Reply to this email directly, view it on GitHub
<#1005 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACUKYX6REBZRZMPAO2E7IQTW5MM5LANCNFSM6AAAAAAWDAT67I>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I think this is fine as a temporary solution, but as you mentioned, the |
Please fix the style issues. |
Due to hard coded start time in load_kaldi_data_dir, cut sets created from segments and feats.scp can have empty supervisions. To fix this issue, resolve start time and duration of features when both segments and feats.scp are available. Supervisions inherit duration computed from feats.scp and Features inherit start time from segments.
looks ok but it would be add test case for the functionality |
@yasumori could you add a unit test as well? We have some tests for Kaldi export/import so hopefully it wouldn't be excessively difficult. |
@pzelasko Yes, I'll push a new commit adding a test soon |
* test/fixtures/mini_librispeech2 is a subset of 3 utterances from test/fixtures/mini_librispeech and contains mfcc feature arks for testing. * Start time of lbi-3536-23268-0000 is deliberately changed to 1.0 in segments to test whether a Feature object inherits start time from segments when feats.scp and segments are both available using load_kaldi_data_dir.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, LGTM
thanks @yasumori
…On Wed, Mar 22, 2023 at 7:51 PM Piotr Żelasko ***@***.***> wrote:
Merged #1005 <#1005> into
master.
—
Reply to this email directly, view it on GitHub
<#1005 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACUKYXYDYJRKLSB332P6FKLW5OGADANCNFSM6AAAAAAWDAT67I>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Fix #987
mat_shape
inutt_id_to_start_and_duration
if callingkaldi_native_io.MatrixShape.read(ark)
twice is an overhead.