Skip to content

Conversation

@jtrmal
Copy link
Contributor

@jtrmal jtrmal commented Mar 21, 2018

due working with several channels having the same transcription (several microphone recording the same conversation), the training data look very artificial from the POV of LM methods, so we have to remove the duplicate transcriptions. This script that does this wasn't modified after changing the format of the utterance ids.

@danpovey danpovey merged commit 22fbdd9 into kaldi-asr:master Mar 21, 2018
@kamo-naoyuki
Copy link

@jtrmal
Copy link
Contributor Author

jtrmal commented Apr 3, 2018 via email

LvHang pushed a commit to LvHang/kaldi that referenced this pull request Apr 14, 2018
Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants