Skip to content

Reco2dur fix#2531

Merged
danpovey merged 2 commits intokaldi-asr:masterfrom
pswietojanski:reco2dur_fix
Jul 3, 2018
Merged

Reco2dur fix#2531
danpovey merged 2 commits intokaldi-asr:masterfrom
pswietojanski:reco2dur_fix

Conversation

@pswietojanski
Copy link
Contributor

@pswietojanski pswietojanski commented Jul 3, 2018

I believe this is a fix to this PR: https://github.com/kaldi-asr/kaldi/pull/2414/files

It originally added reco2dur bits to speed perturbation scripts (so perturb_data_dir_speed_3way.sh takes care of that, and then reverberation scripts do not need to recompute durations at the recording level).

The issue is, that although perturb_data_dir_speed_3way.sh computes reco2dur for the source folder (if it's missing), perturb_data_dir_speed.sh is not modifying it for speed perturbed variants. As a result reco2dur does not exists in speed perturbed folders, and combine_data.sh later omits it when creating _sp data dir. Which is a problem for reverberate_data_dir.py later (as it tries to recompute it from scratch).

@danpovey
Copy link
Contributor

danpovey commented Jul 3, 2018

Looks plausible. It's been tested somehow?

@pswietojanski
Copy link
Contributor Author

yes, I am doing some large scale experiments on multi_en, and this issue was very evident. I did test the fix today and seem to work as expected.

@danpovey danpovey merged commit be7c1aa into kaldi-asr:master Jul 3, 2018
@pswietojanski pswietojanski deleted the reco2dur_fix branch July 3, 2018 22:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants