Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
ebe5e8d
[egs] Bug fix in train_raw_dnn.py
vimalmanohar Sep 26, 2017
fbedee0
steps/cleanup: Fixed corner case in resolve_ctm_edits_overlaps.py
vimalmanohar Nov 1, 2017
fe7d835
Merge branch 'master' of github.com:kaldi-asr/kaldi
vimalmanohar Nov 2, 2017
ada93ca
Merge branch 'master' of github.com:kaldi-asr/kaldi
vimalmanohar Nov 3, 2017
f0627cf
bn: Adding BN recipe
vimalmanohar Jan 6, 2017
6e73dec
bn: Add 1999 BN eval preparation
vimalmanohar Jan 6, 2017
917a670
bn: Add more data preparation scripts
vimalmanohar Jan 11, 2017
d275480
bn: Fix MFCC config
vimalmanohar Jan 11, 2017
4f94a5c
bn: Clean and update recipe
vimalmanohar Jan 11, 2017
eb6fccb
bn: Remove local/lm/text_normalization.py
vimalmanohar Jan 11, 2017
351b447
bn: Fix normalize_transcripts
vimalmanohar Jan 11, 2017
643881e
bn: Updated recipe to add more LM corpora
vimalmanohar Jan 12, 2017
6f316ef
bn: Updating main recipe
vimalmanohar Jan 12, 2017
cc9752c
bn: Minor fixes in BN recipe
vimalmanohar Mar 23, 2017
634d030
HUB4 train preparation scripts
vimalmanohar Nov 3, 2017
e6242ed
bn: Bug fixes and create new scripts
vimalmanohar Nov 3, 2017
560929c
Minor modifications
vimalmanohar Nov 3, 2017
e1dd79e
bn: Modifying some preparation scripts
vimalmanohar Nov 7, 2017
35c9820
bn: Updating bn recipe
vimalmanohar Nov 8, 2017
a528552
bn: Remove some unused scripts
vimalmanohar Nov 8, 2017
9f61a1b
bn: rename bn to hub4_english
vimalmanohar Nov 21, 2017
acbcc2c
bn: Adding patch instead of copying corpus files
vimalmanohar Nov 27, 2017
5edf464
bn: Cleaning up
vimalmanohar Nov 27, 2017
fbcfa55
bn: Adding some results
vimalmanohar Nov 27, 2017
2fe21ad
bn: Remove some options
vimalmanohar Nov 27, 2017
35ade87
bn: Adding more comments and cleaning up scripts
vimalmanohar Nov 28, 2017
b7c0c3b
Adding results etc.
vimalmanohar Nov 28, 2017
8776b5b
bn: adding stages
vimalmanohar Nov 28, 2017
191ae0a
bn: Various bug fixes
vimalmanohar Dec 5, 2017
450def0
Fix some normalization issues
vimalmanohar Dec 27, 2017
148c060
Not removing hyphens for partial words
vimalmanohar Dec 28, 2017
3c96a5f
Minor bug fixes
vimalmanohar Feb 1, 2018
8aeec32
Minor fixes to make the recipe work
vimalmanohar Feb 5, 2018
a9e1668
Adding results
vimalmanohar Feb 5, 2018
0e6c8de
Merge branch 'master' of github.com:vimalmanohar/kaldi into bn
vimalmanohar Feb 6, 2018
427c38a
Fixing some bugs in segment long utterances
vimalmanohar Feb 8, 2018
cd7e12a
Making changes based on comments
vimalmanohar Feb 8, 2018
befd7ee
Update results and remove multiple versions of script
vimalmanohar Feb 11, 2018
f846849
Fixing some issues based on comments
vimalmanohar Feb 12, 2018
da53367
Fixing train_g2p.sh
vimalmanohar Feb 12, 2018
f53a18a
Fixing some lm scripts
vimalmanohar Feb 13, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 33 additions & 0 deletions egs/hub4_english/s5/README
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
This is the English Broadcast News (HUB4) corpus.

1996 English Broadcast News Train (HUB4)
Speech LDC97S44
Transcripts LDC97T22

1997 English Broadcast News Train (HUB4)
Speech LDC98S71
Transcripts LDC98T28

1995 English Broadcast News (CSR-IV HUB4)
LDC96S31

North American News Text Corpus
LDC95T21

North American News Text Supplement Corpus
LDC98T30

1996 CSR HUB4 Language Model
LDC98T31

1996 English Broadcast News Dev and Eval (HUB4)
LDC97S66

1997 HUB4 English Evaluation corpus
LDC2002S11

1998 HUB4 Broadcast News Evaluation English Test Material
LDC2000S86

1999 HUB4 Broadcast News Evaluation English Test Material
LDC2000S88
9 changes: 9 additions & 0 deletions egs/hub4_english/s5/RESULTS
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
for x in exp/*/decode*; do grep Sum $x/score*/*.ctm.*sys | utils/best_wer.sh ; done | sort -k2,2n
exit 0

%WER 17.8 | 728 32834 | 84.1 11.8 4.1 1.9 17.8 82.8 | exp/tri4/decode_nosp_eval97.pem_rescore/score_13_0.5/eval97.pem.ctm.filt.sys
%WER 19.0 | 728 32834 | 83.0 12.7 4.3 2.0 19.0 84.2 | exp/tri4/decode_nosp_eval97.pem/score_13_0.0/eval97.pem.ctm.filt.sys
%WER 19.4 | 728 32834 | 82.7 13.1 4.2 2.1 19.4 83.8 | exp/tri3/decode_nosp_eval97.pem_rescore/score_13_0.0/eval97.pem.ctm.filt.sys
%WER 20.5 | 728 32834 | 81.7 13.9 4.4 2.3 20.5 85.0 | exp/tri3/decode_nosp_eval97.pem/score_13_0.0/eval97.pem.ctm.filt.sys
%WER 23.7 | 728 32834 | 79.0 16.0 5.0 2.7 23.7 85.3 | exp/tri4/decode_nosp_eval97.pem.si/score_12_0.0/eval97.pem.ctm.filt.sys
%WER 25.7 | 728 32834 | 77.1 17.6 5.3 2.8 25.7 85.9 | exp/tri3/decode_nosp_eval97.pem.si/score_13_0.0/eval97.pem.ctm.filt.sys
14 changes: 14 additions & 0 deletions egs/hub4_english/s5/cmd.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# you can change cmd.sh depending on what type of queue you are using.
# If you have no queueing system and want to run on a local machine, you
# can change all instances 'queue.pl' to run.pl (but be careful and run
# commands one by one: most recipes will exhaust the memory on your
# machine). queue.pl works with GridEngine (qsub). slurm.pl works
# with slurm. Different queues are configured differently, with different
# queue names and different ways of specifying things like memory;
# to account for these differences you can create and edit the file
# conf/queue.conf to match your queue's configuration. Search for
# conf/queue.conf in http://kaldi-asr.org/doc/queue.html for more information,
# or search for the string 'default_config' in utils/queue.pl or utils/slurm.pl.

export train_cmd="queue.pl --mem 1G"
export decode_cmd="queue.pl --mem 4G"
1 change: 1 addition & 0 deletions egs/hub4_english/s5/conf/mfcc.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
--use-energy=false # only non-default option.
2 changes: 2 additions & 0 deletions egs/hub4_english/s5/conf/vad.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
--vad-energy-threshold=5.5
--vad-energy-mean-scale=0.5
Loading