-
Notifications
You must be signed in to change notification settings - Fork 5.4k
madcat arabic: clean scripts, tuning, rescoring, text localization #2716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 82 commits
5fe6cb2
e1f4530
c3443d2
9c6a923
2c87fe5
053fbdb
837fd4d
9c1d553
47b6508
18f585e
cf22d16
13f2386
95aed10
85e3649
bff652c
303246e
6b857de
1bd1448
895342a
a72d922
b2ef923
b8974aa
04b938c
2a35cf7
92a470d
8a9b46a
d7092e4
9271545
86ea346
e7b7597
e647607
c1c06d0
a0d2b68
53edde4
e4f973d
e9ae853
8d0c793
ee582d5
a16a11d
fb0b8a2
7835ed4
0234a1a
a17fbb3
59c84f2
a23b478
a3aac1a
8fc860d
cafd89a
87c9241
0e74e55
60915aa
4099d4a
4f98f69
717501f
56c77c4
b9d2651
74f7a82
7597638
479590a
87ab218
d979000
ed3ab45
fa34b22
22df693
95b1c3a
8e40c2e
0b71dae
a5d04ec
e380a20
639289d
78135bb
04e0236
e1efebc
9c33a35
d4516ea
bac599a
9f0259f
c0ac631
f0a990e
09da981
3d9615e
c33da9f
ee42879
405763b
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -21,18 +21,16 @@ reporting_email= | |
| # chain options | ||
| train_stage=-10 | ||
| xent_regularize=0.1 | ||
| frame_subsampling_factor=4 | ||
| # training chunk-options | ||
| chunk_width=340,300,200,100 | ||
| num_leaves=500 | ||
| # we don't need extra left/right context for TDNN systems. | ||
| chunk_left_context=0 | ||
| chunk_right_context=0 | ||
| tdnn_dim=450 | ||
| # training options | ||
| srand=0 | ||
| remove_egs=false | ||
| lang_test=lang_test | ||
| lang_decode=data/lang | ||
| lang_rescore=data/lang_rescore_6g | ||
| # End configuration section. | ||
| echo "$0 $@" # Print the command line for logging | ||
|
|
||
|
|
@@ -168,13 +166,13 @@ if [ $stage -le 5 ]; then | |
| --chain.leaky-hmm-coefficient=0.1 \ | ||
| --chain.l2-regularize=0.00005 \ | ||
| --chain.apply-deriv-weights=false \ | ||
| --chain.lm-opts="--num-extra-lm-states=500" \ | ||
| --chain.frame-subsampling-factor=$frame_subsampling_factor \ | ||
| --chain.alignment-subsampling-factor=$frame_subsampling_factor \ | ||
| --chain.lm-opts="--ngram-order=2 --no-prune-ngram-order=1 --num-extra-lm-states=1000" \ | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. do you know how much does
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Previously, I tried running run_e2e_cnn_1a.sh once with --ngram-order=2 --no-prune-ngram-order=1 and once with --num-extra-lm-states=500 but results were same for madcat arabic 7.81 vs 7.82 WER. But it was more helpful in Tamil OCR setup, it had a absolute WER improvement of around 0.5%. |
||
| --chain.frame-subsampling-factor=4 \ | ||
| --chain.alignment-subsampling-factor=4 \ | ||
| --trainer.srand=$srand \ | ||
| --trainer.max-param-change=2.0 \ | ||
| --trainer.num-epochs=4 \ | ||
| --trainer.frames-per-iter=1000000 \ | ||
| --trainer.frames-per-iter=2000000 \ | ||
| --trainer.optimization.num-jobs-initial=3 \ | ||
| --trainer.optimization.num-jobs-final=16 \ | ||
| --trainer.optimization.initial-effective-lrate=0.001 \ | ||
|
|
@@ -183,10 +181,6 @@ if [ $stage -le 5 ]; then | |
| --trainer.num-chunk-per-minibatch=64,32 \ | ||
| --trainer.optimization.momentum=0.0 \ | ||
| --egs.chunk-width=$chunk_width \ | ||
| --egs.chunk-left-context=$chunk_left_context \ | ||
| --egs.chunk-right-context=$chunk_right_context \ | ||
| --egs.chunk-left-context-initial=0 \ | ||
| --egs.chunk-right-context-final=0 \ | ||
| --egs.dir="$common_egs_dir" \ | ||
| --egs.opts="--frames-overlap-per-eg 0" \ | ||
| --cleanup.remove-egs=$remove_egs \ | ||
|
|
@@ -207,18 +201,20 @@ if [ $stage -le 6 ]; then | |
| # as long as phones.txt was compatible. | ||
|
|
||
| utils/mkgraph.sh \ | ||
| --self-loop-scale 1.0 data/$lang_test \ | ||
| --self-loop-scale 1.0 $lang_decode \ | ||
| $dir $dir/graph || exit 1; | ||
| fi | ||
|
|
||
| if [ $stage -le 7 ]; then | ||
| frames_per_chunk=$(echo $chunk_width | cut -d, -f1) | ||
| steps/nnet3/decode.sh --acwt 1.0 --post-decode-acwt 10.0 \ | ||
| --extra-left-context $chunk_left_context \ | ||
| --extra-right-context $chunk_right_context \ | ||
| --extra-left-context-initial 0 \ | ||
| --extra-right-context-final 0 \ | ||
| --frames-per-chunk $frames_per_chunk \ | ||
| --nj $nj --cmd "$cmd" \ | ||
| $dir/graph data/test $dir/decode_test || exit 1; | ||
|
|
||
| steps/lmrescore_const_arpa.sh --cmd "$cmd" $lang_decode $lang_rescore \ | ||
| data/test $dir/decode_test{,_rescored} || exit 1 | ||
| fi | ||
|
|
||
| echo "Done. Date: $(date). Results:" | ||
| local/chain/compare_wer.sh $dir | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's not clear to me why the
aug_settingvariable is used here, I think it would be more clear if you directly used 'normal', etc.