Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
157 commits
Select commit Hold shift + click to select a range
6f4fafb
[src] Bug fix: oscillating cluster threshold in agglomerative-cluster…
mmaciej2 Apr 3, 2018
fe36823
[src] Fix bug causing wrong learning rate in RNNLM training with --st…
nshmyrev Apr 3, 2018
831dd38
[src] make fst templates inline to eliminate linking errors in other …
chenzhehuai Apr 3, 2018
7436ed6
[src] Make it so no threads are used if you set num-threads to 0.
sikoried Apr 3, 2018
a9e4f74
[build] Fix some problems with sequitur installation (#2337)
jtrmal Apr 4, 2018
1814278
[scripts] Make sure nj in get_utt2dur.sh does not exceed num-utts. (t…
danpovey Apr 4, 2018
5294666
[scripts] Fix to nnet3 bug RE per-utt splitting that appeared after #…
danpovey Apr 5, 2018
1a1e265
[src,build] Fix cygwin compatibility; remove unnecessary memalign mac…
qbetterk Apr 5, 2018
60862b0
[scripts] Make nnet3 logging output look more like chain logging outp…
Ore-an Apr 6, 2018
c2d796d
[scripts] Fix bug in chain/get_egs.sh affecting semi-supervised train…
vimalmanohar Apr 6, 2018
c643295
[egs] IAM script fixes using the right lang for training/lattice gene…
hhadian Apr 7, 2018
c0c55ad
[egs] IAM: add chain recipe starting with e2e alignments and tree (#2…
hhadian Apr 8, 2018
9b90cba
[egs] Fix scoring issue in IAM (#2346)
hhadian Apr 9, 2018
e10c478
[src] Revert KALDI_MEMALIGN->aligned_alloc change from #2304, fixes #…
danpovey Apr 10, 2018
8a4ab19
[src] Fix nnet3-copy-egs bug for multiling. non-chain training (thx:J…
danpovey Apr 11, 2018
ad48342
[scripts] Bug fix in combine_egs.sh (for multilingual training) (#2351)
vimalmanohar Apr 11, 2018
2d0cd79
[scripts] Finish removing transform_dir options from nnet3 training (…
danpovey Apr 12, 2018
adcabe4
[src] Fix (rare) divergence from orthogonal constraint. Thx: Sergey …
danpovey Apr 12, 2018
ccd50e2
[egs] Adding Wellington corpus for LM in IAM OCR (#2354)
ChunChiehChang Apr 13, 2018
e99de50
[scripts] Make beam configurable, +default smaller, in pruned RNNLM …
hainan-xv Apr 13, 2018
535df00
[src] small fix to comment (#2357)
Apr 14, 2018
dd7caa6
[egs] A fix to second-stage chain recipes in IAM and lang fixes in UW…
hhadian Apr 14, 2018
dd1640b
[egs] Fix script bug in IAM setup
adelra Apr 16, 2018
4abdb5b
[src] Cosmetic code fixes in mel-computations.cc (#2359)
Apr 16, 2018
5a48ad4
[scripts] Fix to get_saturation.pl, to regex bug introduced around Ja…
danpovey Apr 19, 2018
8f6bb2d
[egs,scripts] Add factorized TDNN training recipe for multi-en; fix p…
xiaohui-zhang Apr 19, 2018
1331a35
[egs] Make Wellington Corpus optional in iam_ocr (#2370)
ChunChiehChang Apr 19, 2018
5318100
[src,egs] Fix implementation of l2-regularize with backstitch; add am…
freewym Apr 20, 2018
3040a0a
[egs] Fix commonvoice run.sh (source utils/parse_options.sh to enable…
entn-at Apr 20, 2018
9692a61
[src] Fix bug in matrix compression RE stride type. Thx:@LvHang (#2376)
danpovey Apr 20, 2018
6bd7ca7
[src] Decoder fix to bug affecting unusual FST types (#2377)
mike-clark-8192 Apr 20, 2018
4fda1b0
[src] Comment fix to transition-model.h (#2378)
Apr 22, 2018
8ff7fd9
[scripts] Added utils/prepare_extended_lang.sh (extending vocab of a …
xiaohui-zhang Apr 24, 2018
f0333bb
[src,scripts,egs] Support for unconstrained-egs for chain training +…
hhadian Apr 24, 2018
d25b3ef
[src] Add a cast to prevent errors from some compilers (#2384)
danijel3 Apr 25, 2018
156ec6c
[egs] Semisupervised training recipes: bug fix regarding context info…
vimalmanohar Apr 25, 2018
6a2dce8
[egs] Remove accidentally committed "exit 0" in WSJ run.sh (#2386)
Apr 25, 2018
032c026
[src,scripts] Cosmetic fixes.
danpovey Apr 26, 2018
5eaf49f
[scripts] add wordlists mismatch test for rnnlm rescoring (#2387)
hainan-xv Apr 26, 2018
c0ef356
[scripts] Clarify usage of diarization make_rttm.py (#2390)
mmaciej2 Apr 27, 2018
fb54444
[egs] Add speaker verification recipe for the VoxCeleb2 corpus (#2380)
entn-at Apr 27, 2018
bf40d61
[src] Remove un-needed code. Thanks: anand@sayint.ai. (#2393)
huangruizhe Apr 29, 2018
5bac98e
[egs] Update nnet3/chain scripts for HKUST example (#2368)
Apr 29, 2018
e15f3a1
[scripts] Fix in steps/nnet3/get_egs.sh to avoid failure due to broke…
freewym May 1, 2018
1e2a0fd
[egs] Cleanup and update results in IAM OCR scripts (#2395)
ChunChiehChang May 1, 2018
64025aa
[scripts] rnnlm scripts: ignore first iteration while looking for the…
nshmyrev May 2, 2018
6a32fbd
[scripts] Fix path.sh bug in vad_to_segments.sh (#2402)
nshmyrev May 3, 2018
20b2a45
[src] Remove unnecessary file chain/context-dep-topology.h (#2401)
huangruizhe May 3, 2018
080129e
[src,scripts,egs] x-vectors for diarization (#2391)
david-ryan-snyder May 4, 2018
4edec5d
[egs,scripts,src] Add IFN/ENIT dataset Handwriting Recognition Exampl…
babakrkb May 5, 2018
f9b00e9
[src] Some small cosmetic changes (#2404)
danpovey May 6, 2018
e18191c
[egs] Fix python error by moving __future__ import to top (#2405)
entn-at May 7, 2018
954e69a
[src] make e2e/"unconstrained" numerator computation faster (#2392)
jtrmal May 7, 2018
b03b641
[src] Speed up VectorBase<Real>::Sum() by using BLAS (#2394)
galv May 8, 2018
08b47be
[build] update dockerfile for Ubuntu and cuda (#2408)
sih4sing5hong5 May 9, 2018
90363ea
[egs] Update VoxCeleb Recipe (#2403)
david-ryan-snyder May 11, 2018
bce4336
[egs] Speeding up i-vector training in voxceleb v1 recipe (#2421)
david-ryan-snyder May 14, 2018
ff0da26
[scripts] Add util to compute recordings durations before data pertur…
Ore-an May 14, 2018
f8b678a
[scripts] Update to decode_semisup.sh (copy input model) (#2425)
vimalmanohar May 15, 2018
108832d
[egs] Madcat Arabic handwritten text line recognition (#2356)
aarora8 May 17, 2018
2ad8d78
[egs] Minor fixes to x-vector based recipes (#2426)
entn-at May 18, 2018
fd12973
[egs] update iban tdnn CE and chain recipes (#2429)
freewym May 18, 2018
558f010
[scripts] Add a warning message to nnet3/align.sh about using GPUs. (…
hhadian May 19, 2018
f31c1e9
[src,scripts] Support word insertion penalty when getting ctm (#2433)
ronggong May 21, 2018
264a6a3
[scripts] Fix fix_data_dir.sh bug introduced in #2414; thanks, @Fhroz…
danpovey May 21, 2018
b1be44e
[scripts] Adding script to compute minDCF for speaker recognition (#…
david-ryan-snyder May 21, 2018
1ab3df0
[egs] Madcat arabic: easier configuration of data locations; fix wer_…
aarora8 May 22, 2018
72d89ce
[egs] egs/tedlium/s5_r2 download Tedlium LMs from kaldi-asr.org (#2441)
david-ryan-snyder May 22, 2018
182f382
[src] Fix bug in rnnlm-sentence-probs (failure in non-debug builds) (…
hainan-xv May 23, 2018
7051384
[egs] Fix affix in run_tdnn_7p recipe (swbd) (#2446)
hhadian May 24, 2018
447e964
[egs] Add recipes for Speakers in the Wild (SITW) (#2422)
david-ryan-snyder May 24, 2018
b1ae952
[scripts,egs] Support averaging forward and backward RNNLMs (#2436)
hainan-xv May 24, 2018
6b94eb7
[scripts] Add retry.pl, to be used when queue is flaky. (#2450)
danpovey May 24, 2018
e3556a0
[src] Add missing 'fst::Connect() in parallel determinization; fixes …
KarelVesely84 May 25, 2018
e0311e0
[scripts] Fix bug in retry.pl. (#2453)
danpovey May 25, 2018
9b23b17
[egs] Fix a small bug in swbd 7p recipe (was not decoding some test s…
hhadian May 25, 2018
d6d49d0
[scripts] Apply minimum segment duration in vad_to_segments.sh (#2447)
francoishernandez May 26, 2018
7ffc9dd
[scripts] Make retry.pl deal correctly with keyboard interrupts; make…
danpovey May 26, 2018
d23cad8
[scripts] Allow floating-point cv_spk_percent in nnet1 scripts (#2459)
megazone87 May 29, 2018
3f4f425
[egs] Fix TDNN example scripts to respect given train_cmd (#2460)
stanleyguan May 29, 2018
e014051
[src] Fix bug in lattice-oracle relating to wildcards in lattice outp…
dogancan May 30, 2018
a486fa1
[scripts] Fix harmless bug in chain/get_egs.sh, thanks: teinhonglo@gm…
danpovey May 31, 2018
9f3aa11
[scripts] Fix failure in make_bottleneck_features.sh (#2443)
jtrmal May 31, 2018
0fb7950
[egs] Fix to paths in iban recipe (#2467)
vimalmanohar May 31, 2018
fd545bb
[egs] bug fix in egs/sitw/v2/run.sh (#2465)
david-ryan-snyder Jun 1, 2018
6f5f778
[doc] Fix I/O documentation example (#2468)
emilmelnikov Jun 1, 2018
eef2a09
[scripts] Fix to steps/segmentation/internal/sad_to_segments.py (#2470)
Jun 1, 2018
e0e0bec
[egs] Add TDNN+OPGRU recipe for multi-en (#2469)
xiaohui-zhang Jun 2, 2018
c8db7a9
[src] TopSort lattice-oracle lattice output (#2463)
dogancan Jun 2, 2018
04b1f7d
[egs,scripts] Byte-pair encoding (BPE) applied MADCAT Arabic OCR (#2434)
babakrkb Jun 4, 2018
7659d8b
[egs] Change renorm to batchnorm in BABEL recipes (#2476)
jtrmal Jun 5, 2018
ce2eb39
[src,scripts] Add steps/nnet3/chain/get_phone_post.sh, to get phone p…
danpovey Jun 5, 2018
b38d263
[egs,scripts] Fix train/test data split in make_voxceleb1.pl, make s…
weichengcai Jun 6, 2018
28e4ee6
[scripts] Fix typo in validate_data_dir.sh causing failure if reco2du…
alumae Jun 6, 2018
5a6477b
[scripts] Improve encoding compatibility of rnnlm dataprep (#2455)
KarelVesely84 Jun 8, 2018
e1abd81
[build,src] Upgrade OpenFst from 1.6.5->1.6.7 (#2313)
jtrmal Jun 8, 2018
40757c2
[scripts,build] ban carriage-return (\r) in lang/; suppress 'ar' warn…
jtrmal Jun 9, 2018
0f954c2
[build] Update supported compilers for CUDA 9.2. (#2486)
LvHang Jun 9, 2018
8fae925
[build] Fix syntax error in configure script introduced in #2486 (#2488)
nanma Jun 10, 2018
4571f47
[scripts] Small bugfix in steps/cleanup/decode_segmentation.sh (#2489)
gooofy Jun 10, 2018
707baa0
[egs] updating results in voxceleb recipe (#2494)
david-ryan-snyder Jun 11, 2018
8fd2ba5
[src] Make it so very wide-context nnet3 models don't cause a crash. …
danpovey Jun 11, 2018
be96725
[scripts] Add script to limit <unk> history of ARPA files, for OOV-m…
olix20 Jun 12, 2018
1e49578
[build] change the build status badge location (#2497)
jtrmal Jun 13, 2018
52b2882
[src] Fix mistake in comment RE FFT dimensions (#2495)
Jun 13, 2018
ca239f2
[scripts] Make inline script in extract_ivectors.sh pyton3 compatible…
zhuohaochen Jun 13, 2018
b4eda57
[scripts] Make inline script in extract_ivectors.sh python3 compatibl…
zhuohaochen Jun 14, 2018
f49b3fc
[scripts,src] added a script to scale arcs which output <unk> in HCLG…
xiaohui-zhang Jun 15, 2018
bfd0d3c
[src] Fix compilation issue with DOUBLE_PRECISION flag (#2496) (#2505)
jhennrich Jun 15, 2018
9210f3a
[src] Speed improvement in pruned lattice-rescoring algorithm (#2504)
hainan-xv Jun 15, 2018
d2b1c55
[scripts] Make soft link from steps/chain to steps/nnet3/chain (#2507)
hhadian Jun 15, 2018
2fa70e3
[scripts,egs] Minor updates to WSJ-e2e recipes and scripts (#2506)
hhadian Jun 15, 2018
f86792e
[src] Remove unused binary 'extract-rows' (#2510)
KarelVesely84 Jun 18, 2018
598b177
[scripts] Bug fix to failure in chain/train.py when --trainer.optimi…
rcgale Jun 19, 2018
775c770
[src] Minor optimizations in "e2e" numerator code (#2508)
hhadian Jun 19, 2018
42a673a
[egs] Add recipe for AIShell2 (large Mandarin database) (#2398)
dophist Jun 20, 2018
d16ef4b
[src] Change smart ptr copy to move sematics, avoid cost of atomic in…
Lallapallooza Jun 20, 2018
60141df
[scripts] enable --trainer.input-model option for non-chain nnet3 tra…
naxingyu Jun 25, 2018
8ce3a95
[src] Make RNNLM training single threaded (workaround for CuSparse li…
hainan-xv Jun 27, 2018
e4908f0
[build] disable -march=native in travis build (#2511)
jtrmal Jun 29, 2018
7605fe5
[src] lattice-copy: fix to segfault with --include,--exclude options …
jfainberg Jun 29, 2018
c1d6719
[scripts] fixing bug present since #2515 in steps/nnet3/train_raw_{dn…
david-ryan-snyder Jul 1, 2018
a34aefd
[build] Fix typo in comment in src/configure (#2529)
mpuels Jul 2, 2018
8a27e86
[src] Add a check for utterance length when creating end2end egs (#2528)
hhadian Jul 2, 2018
be7c1aa
[scripts] Output the reco2dur file in speed-perturbation script (#2531)
pswietojanski Jul 3, 2018
ad93210
[egs] Fix to training list in egs/sitw{v1,v2} recipe (#2535)
david-ryan-snyder Jul 5, 2018
6edf1be
[src] Fix to nnet3-chain-get-egs corner-case bug discovered by Armand…
danpovey Jul 6, 2018
998a4d6
[egs] Some fixes and cleanup in Aishell2 scripts (#2522)
Jul 6, 2018
eacf34a
[src,scripts,egs] Updated TDNN-F recipes incorporating resnet skip co…
danpovey Jul 7, 2018
295877c
[scripts] Add decoding script that combines nnet3 outputs (#2534)
teinhonglo Jul 7, 2018
00bc59d
[egs] added evaluation scripts to chime-5 setup (#2521)
sw005320 Jul 9, 2018
147f0b5
[egs] Make validate_text.pl say more specifically which lines have ba…
danpovey Jul 10, 2018
09d6706
[egs] Fix to Swbd script to reflect what I actually ran
danpovey Jul 10, 2018
ae4292d
[egs] Fix bug in Chime5 eval script (#2542)
sw005320 Jul 11, 2018
6f2140b
[scripts] Fix to get_reco2dur.sh (thanks: Zhangfeifan) (#2544)
danpovey Jul 11, 2018
fdb6774
[egs] Add scripts for release 3 of Tedlium corpus (#2442)
francoishernandez Jul 12, 2018
2c1b11a
[src] Change decodable code so nnet context does not have to be recom…
danpovey Jul 15, 2018
bbba5ef
[scripts] Add missing quote in vad_to_segments.sh script (#2550)
nshmyrev Jul 15, 2018
79883f3
[scripts] add utility script compute_sentence_probs_arpa.py to comput…
DongjiGao Jul 17, 2018
654d66b
[scripts] BSD-compatibilty fix where the "-s" option of seq is used (…
LiChenda Jul 18, 2018
c029775
[src] Speed up AddDiagMat2 for very thin and tall matrices (#2555)
kangshiyin Jul 19, 2018
a630dd2
[egs] fixed a minor issue (wrong graph dir) in CommonVoice setup (#2563)
mdoulaty Jul 21, 2018
1d61f1c
[src,scripts,egs] Misc small fixes, including fix to TDNN-F results i…
danpovey Jul 22, 2018
8ae505e
[scripts] Fix to scripts/rnnlm/get_num_splits.sh RE large datasets (…
misbullah Jul 23, 2018
e6fe7e8
[scripts] Removing has_key() for python3 compatibility (see #2570) (#…
JRMeyer Jul 24, 2018
de03a75
[scripts] Fix an issue in BPE-related scripts (don't assume opt-sil i…
hhadian Jul 25, 2018
06604a8
[src] Updated cuda kernel to speedup AddDiagMatMat[kTrans, kNoTrans] …
kangshiyin Jul 25, 2018
1616653
[scripts] Fix to analyze_alignments/lats.sh (double-counting) (#2573)
eginhard Jul 25, 2018
97b78df
[egs] Update TDNN-F script for Librispeech (#2574)
danpovey Jul 26, 2018
5391f00
[src] Minor code-style fixes to context-dependency code (#2576)
Jul 26, 2018
d4d968c
[src] Add binary fsts-concat to concatenate Kaldi archives of FSTs (l…
KarelVesely84 Jul 26, 2018
b8fd2cd
[egs] Set the acoustic scale properly when generating e2e chain align…
hhadian Jul 27, 2018
ec71be4
[scripts] Fix typo in steps/cleanup/decode_fmllr_segmentation.sh (#2580)
MartinKocour Jul 27, 2018
5cc9731
[src] added configure support for cuda on arm64 (#2577) (#2578)
frpunzalan Jul 27, 2018
5b27111
[egs] Cosmetic fix in aishell recipes (#2582)
teinhonglo Jul 29, 2018
68c926a
[scripts] Call the right script when fixing/validating data directori…
hhadian Aug 1, 2018
04027ee
[src] fix embedding training bug for RNNLM without letter features (#…
hainan-xv Aug 2, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -101,6 +101,8 @@ GSYMS
/tools/openfst-1.6.2/
/tools/openfst-1.6.5.tar.gz
/tools/openfst-1.6.5/
/tools/openfst-1.6.7.tar.gz
/tools/openfst-1.6.7/
/tools/BeamformIt/
/tools/libsndfile-1.0.25.tar.gz
/tools/libsndfile-1.0.25/
Expand Down
3 changes: 2 additions & 1 deletion .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ addons:
- gfortran-4.9
- liblapack-dev
- clang-3.8
- sox

branches:
only:
Expand All @@ -47,7 +48,7 @@ script:
# http://peter.eisentraut.org/blog/2014/12/01/ccache-and-clang-part-3/
# for the explanation why extra switches needed for clang with ccache.
- CXX="ccache clang++-3.8 -Qunused-arguments -fcolor-diagnostics -Wno-tautological-compare"
CFLAGS="-march=native"
CFLAGS=""
LDFLAGS="-llapack"
INCDIRS="$XROOT/usr/include"
LIBDIRS="$XROOT/usr/lib"
Expand Down
12 changes: 6 additions & 6 deletions COPYING
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ contributors and original source material as well as the full text of the Apache
License v 2.0 are set forth below.

Individual Contributors (in alphabetical order)

Mohit Agarwal
Tanel Alumae
Gilles Boulianne
Expand Down Expand Up @@ -123,7 +123,7 @@ Individual Contributors (in alphabetical order)
Haihua Xu
Hainan Xu
Xiaohui Zhang

Other Source Material

This project includes a port and modification of materials from JAMA: A Java
Expand All @@ -136,9 +136,9 @@ Other Source Material
"Signal processing with lapped transforms," Artech House, Inc., 1992. The
current copyright holder, Henrique S. Malvar, has given his permission for the
release of this modified version under the Apache License 2.0.
This project includes material from the OpenFST Library v1.2.7 available at
http://www.openfst.org and released under the Apache License v. 2.0.

This project includes material from the OpenFST Library v1.2.7 available at
http://www.openfst.org and released under the Apache License v. 2.0.

[OpenFst COPYING file begins here]

Expand All @@ -147,7 +147,7 @@ Other Source Material
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
Expand Down
3 changes: 1 addition & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,4 @@
[![Build Status](https://travis-ci.org/kaldi-asr/kaldi.svg?branch=master)](https://travis-ci.org/kaldi-asr/kaldi)

[![Build Status](https://travis-ci.com/kaldi-asr/kaldi.svg?branch=master)](https://travis-ci.com/kaldi-asr/kaldi)
Kaldi Speech Recognition Toolkit
================================

Expand Down
2 changes: 1 addition & 1 deletion egs/aishell/s5/local/aishell_train_lms.sh
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ kaldi_lm=`which train_lm.sh`
if [ -z $kaldi_lm ]; then
echo "$0: train_lm.sh is not found. That might mean it's not installed"
echo "$0: or it is not added to PATH"
echo "$0: Use the script tools/extra/install_kaldi_lm.sh to install it"
echo "$0: Use the script tools/extras/install_kaldi_lm.sh to install it"
exit 1
fi

Expand Down
50 changes: 50 additions & 0 deletions egs/aishell2/README.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
# AISHELL-2

AISHELL-2 is by far the largest free speech corpus available for Mandarin ASR research.
## 1. DATA
### training data
* 1000 hours of speech data (around 1 million utterances)
* 1991 speakers (845 male and 1146 female)
* clean recording environment(studio or quiet living room)
* read speech
* reading prompts from various domain: entertainment, finance, technology, sports, control command, place of interest etc.
* near field recording via 3 parallel channels(iOS, Android, Microphone).
* iOS data is free for non-commercial research and education use (e.g. universities and colleges)

### evaluation data:
Currently we release AISHELL2-2018A-EVAL, containing:
* dev: 2500 utterances from 5 speaker
* test: 5000 utterances from 10 speakers

you can download above evaluation set from:
http://www.aishelltech.com/aishell_eval

we may update and release other evaluation sets on the website later, targeting on different applications and senarios.

## 2. RECIPE
Based on Kaldi standard system, AISHELL-2 provides a self-contained Mandarin ASR recipe, with:
* a word segmentation module, which is a must-have component for Chinese ASR systems
* an open-sourced Mandarin lexicon(DaCiDian)
* a simplified GMM training recipe
* acoustic channel adaptation recipe(AM fine-tuning)

# CONTACT
AISHELL foundation is a non-profit online organization, with members from speech industry and research institutes.

We hope AISHELL-2 corpus and recipe could be beneficial to the entire speech community.

Depends on your location and internet speed, we distribute the corpus in two ways:
* hard-disk delivery
* cloud-disk downloading

To apply for AISHELL-2 corpus for free, you need to fill in a very simple application form, confirming that:
* university department / education institute info
* only for non-commercial research / education use

AISHELL-foundation covers all data distribution fees (including the corpus, hard-disk cost etc)

Data re-distribution inside your university department is OK for convenience. However, users are not supposed to re-distribute AISHELL-2 to other universities or education institutes.

To get the application form, or you come across any problem with the recipe, contact us via:

aishell.foundation@gmail.com
6 changes: 6 additions & 0 deletions egs/aishell2/s5/RESULTS
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
%WER 44.78 [ 22176 / 49527, 370 ins, 2179 del, 19627 sub ] exp/mono/decode_test/cer_9_0.0
%WER 24.78 [ 12271 / 49527, 394 ins, 815 del, 11062 sub ] exp/tri1/decode_test/cer_11_0.0
%WER 22.54 [ 11165 / 49527, 390 ins, 665 del, 10110 sub ] exp/tri2/decode_test/cer_11_0.0
%WER 19.78 [ 9795 / 49527, 313 ins, 684 del, 8798 sub ] exp/tri3/decode_test/cer_13_0.0

# (Chain model results are at the beginning of corresponding scripts)
15 changes: 15 additions & 0 deletions egs/aishell2/s5/cmd.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
# you can change cmd.sh depending on what type of queue you are using.
# If you have no queueing system and want to run on a local machine, you
# can change all instances 'queue.pl' to run.pl (but be careful and run
# commands one by one: most recipes will exhaust the memory on your
# machine). queue.pl works with GridEngine (qsub). slurm.pl works
# with slurm. Different queues are configured differently, with different
# queue names and different ways of specifying things like memory;
# to account for these differences you can create and edit the file
# conf/queue.conf to match your queue's configuration. Search for
# conf/queue.conf in http://kaldi-asr.org/doc/queue.html for more information,
# or search for the string 'default_config' in utils/queue.pl or utils/slurm.pl.

export train_cmd="queue.pl --mem 2G"
export decode_cmd="queue.pl --mem 4G"
export mkgraph_cmd="queue.pl --mem 8G"
3 changes: 3 additions & 0 deletions egs/aishell2/s5/conf/decode.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
beam=11.0 # beam for decoding. Was 13.0 in the scripts.
first_beam=8.0 # beam for 1st-pass decoding in SAT.

2 changes: 2 additions & 0 deletions egs/aishell2/s5/conf/mfcc.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
--use-energy=false
--sample-frequency=16000
10 changes: 10 additions & 0 deletions egs/aishell2/s5/conf/mfcc_hires.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# config for high-resolution MFCC features, intended for neural network training.
# Note: we keep all cepstra, so it has the same info as filterbank features,
# but MFCC is more easily compressible (because less correlated) which is why
# we prefer this method.
--use-energy=false # use average of log energy, not energy.
--sample-frequency=16000 # AISHELL-2 is sampled at 16kHz
--num-mel-bins=40 # similar to Google's setup.
--num-ceps=40 # there is no dimensionality reduction.
--low-freq=20 # low cutoff frequency for mel bins
--high-freq=-400 # high cutoff frequency, relative to Nyquist of 8000 (=7600)
1 change: 1 addition & 0 deletions egs/aishell2/s5/conf/pitch.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
--sample-frequency=16000
80 changes: 80 additions & 0 deletions egs/aishell2/s5/local/chain/compare_wer.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
#!/bin/bash

# Copyright 2018 Emotech LTD (Author: Xuechen LIU)
# Apache 2.0

# compare wer between diff. models in aishell2 chain directory

set -e
. ./cmd.sh
. ./path.sh

if [ $# == 0 ]; then
echo "Usage: $0: <dir1> [<dir2> ... ]"
echo "e.g.: $0 exp/chain/tdnn_7h_sp"
exit 1
fi

echo "# $0 $*"

set_names() {
if [ $# != 1 ]; then
echo "compare_wer_general.sh: internal error"
exit 1 # exit the program
fi
dirname=$(echo $1 | cut -d: -f1)
}

# print model names
echo -n "# Model "
for x in $*; do
printf "% 10s" " $(basename $x)"
done
echo

# print number of parameters
echo -n "# Num. of params "
for x in $*; do
set_names $x
params=$(steps/info/chain_dir_info.pl "$x" | grep -o 'num-params=[0-9]*\.[0-9]*M' | cut -d'=' -f2-)
printf "% 10s\n" $params
done

# print decode WER results
echo -n "# WER(%) "
for x in $*; do
set_names $x
wer=$([ -d $x ] && grep WER $x/decode_test/cer_* | utils/best_wer.sh | awk '{print $2}')
printf "% 10s" $wer
done
echo

# print final log prob for train & validation
echo -n "# Final train prob "
for x in $*; do
prob=$(grep Overall $x/log/compute_prob_train.final.log | grep -v xent | awk '{printf($8)}' | cut -c1-7)
printf "% 10s" $prob
done
echo

echo -n "# Final valid prob "
for x in $*; do
prob=$(grep Overall $x/log/compute_prob_valid.final.log | grep -v xent | awk '{printf($8)}' | cut -c1-7)
printf "% 10s" $prob
done
echo

# do the same for xent objective
echo -n "# Final train prob (xent)"
for x in $*; do
prob=$(grep Overall $x/log/compute_prob_train.final.log | grep -w xent | awk '{printf("%.4f", $8)}')
printf "% 10s" $prob
done
echo

echo -n "# Final valid prob (xent)"
for x in $*; do
prob=$(grep Overall $x/log/compute_prob_valid.final.log | grep -w xent | awk '{printf("%.4f", $8)}')
printf "% 10s" $prob
done
echo
1 change: 1 addition & 0 deletions egs/aishell2/s5/local/chain/run_tdnn.sh
Loading