Update after from original #5

dresen · 2016-12-02T14:45:21Z

No description provided.

* Fixed OpenFST building on Windows/cygwin64, OSTYPE doesn't exist, OS = Windows_NT * Old OSTYPE branch is kept for safety purposes.

…n device' (#1183) * Change Travis build to use shared libraries to avoid 'no space left on device' error * Change instructions to make --shared the default (faster compile)

Added reverberation based data augmentation recipe for AMI. Gives gains in IHM, SDM and MDM settings. (TDNN + Chain recipe checked in).

… problem.

This commit changes the way gradient clipping is done in LSTMs and BLSTMs to be a bit more similar to "truncated BPTT", where we zero out the gradient at the edges of blocks (default block size: 30). In fact, we only do this if the gradient is above a certain size (3.0 by default). As before, on all frames, we clip gradients that are too large (default threshold: 30.0). This improves results slightly (or leaves them the same) and seems to be helpful in controlling instability that we used to occasionally see in BLSTM training. Caution: the default options of the 'make_configs' scripts have changed, so if you rerun an old BLSTM setup from the config generation stage it will not be quite the same.

fixing arpa2fst to allow traling whitespace in the headers

Fix nnet3 endpointing to correctly use frame subsampling factor (#1184)

… for computation in nnet3 setup. For more details see Issue #1190 (#1194)

…t lists

Fix an asymmetry in how the derivatives were truncated outside the chunk for BLSTM training. [Caution: may change BLSTM results.] In the nnet3 training code there is a mechanism --{min,max}-deriv-time to stop processing derivatives outside of a particular time range, which can be used to stop wasteful and possibly harmful computation in the e.g. +-40-frame context outside of the chunk boundaries where the supervision lies. [E.g. the gradients may blow up there.] Due to a previous oversight, this was previously only applied on the left, i.e. the python script set the --min-deriv-time not the --max-deriv-time. This commit fixes that, and also tunes the time values used in the scripts, to limit the derivatives to +-10 frames around the supervised chunk. Results for BLSTM training are improved where tested. Caution: if you are tuning BLSTM things, you may need to re-run baselines after you merge this change.

Added (B)LSTM scripts for ami/s5b and tedlium/s5_r2

Modify TransitionModel for more compact chain-model graphs

…odel]; also, cosmetic fix to steps/nnet3/chain/train.py

…mismatches.

1. Added TDNN+LSTM recipe which performs similar to BLSTM model with significantly smaller latency (21 frames vs 51 frames). 2. Added BLSTM results in xconfig setup, without layer-wise discriminative pre-training (2.7% rel. improvement) 3. Added an example TDNN recipe which uses subset of feature vector from neighboring time steps (results pending). xconfig : Added a tdnn layer which can deal with subset-dim option.

…od improvement. (#1211)

…ut in nnet3-latgen-faster-parallel

… (atomicAdd not supported there, needed for chain models).

…ow ';' as a word when those scripts are used. Bug fix in egs/wsj/s5/local/run_segmentation.sh.

…th zero learning rates, backprop does not have to be done.

…n is slow if using older-style LM-formatting script. Do this by disabling a recently introduced optimization if disambig-symbol is not specified.

Fix bugs for DOUBLE_PRECISION = 1

move propagate of norm componet to cu math move to cu math fix cu math bug CuMatrix::NormalizePerRow<float>, 16 0.015 0.001 13.16x CuMatrix::NormalizePerRow<float>, 32 0.062 0.005 13.54x CuMatrix::NormalizePerRow<float>, 64 0.239 0.019 12.77x CuMatrix::NormalizePerRow<float>, 128 0.748 0.074 10.16x CuMatrix::NormalizePerRow<float>, 256 2.255 0.289 7.79x CuMatrix::NormalizePerRow<float>, 512 5.399 1.001 5.39x CuMatrix::NormalizePerRow<float>, 1024 10.010 2.731 3.67x CuMatrix::NormalizePerRow<double>, 16 0.015 0.001 12.45x CuMatrix::NormalizePerRow<double>, 32 0.059 0.005 12.69x CuMatrix::NormalizePerRow<double>, 64 0.236 0.018 12.81x CuMatrix::NormalizePerRow<double>, 128 0.701 0.072 9.78x CuMatrix::NormalizePerRow<double>, 256 1.738 0.279 6.23x CuMatrix::NormalizePerRow<double>, 512 4.415 0.903 4.89x CuMatrix::NormalizePerRow<double>, 1024 7.392 2.154 3.43x fix small bug. strictly follow the original impl. fix kernel bug add comment to the cuda kernel function

speed test for normlize per row correctness test for normalize per row move test to cu math test fix test bug

See #1209

New CUDA kernel for NormalizeComponet::propagate

For #1223

Look in right location for new style subdirectories

* This commit modifies the tedlium s5_r2 setup to use the original LM from TedliumRelease2 (instead of an LM built from the cantab-tedlium data).

#1224) * changed the definition of deriv-truncate-margin option. Now the margin is defined on top of the model contexts, which applies to more general network archs. Change the values used in the scripts from 10 to 8 to compensate for the script change, so we don't have to rerun experiments.

…de crash; compile problem on nvcc 8.0; fix thread-sync errors. (#1228) This fixes a synchronization problem introduced by PR #1217 (merged yesterday) that can cause crashes in TDNN training.

swbd : added results for the TDNN recipe which uses the subset-dim option

…1231)

…ailover to openslr faster (server is often down).

* This pull request implements an automatic method of finding likely bugs in a lexicon, and providing suggested fixes. Useful if your lexicon is incomplete or contains errors.

binghe and others added 30 commits November 10, 2016 19:28

Fixed OpenFST building on Windows/cygwin64 (#1182)

05f7fbd

* Fixed OpenFST building on Windows/cygwin64, OSTYPE doesn't exist, OS = Windows_NT * Old OSTYPE branch is kept for safety purposes.

Change Travis build to use shared libraries to avoid 'no space left o…

49eb733

…n device' (#1183) * Change Travis build to use shared libraries to avoid 'no space left on device' error * Change instructions to make --shared the default (faster compile)

Adding the reverberation script to AMI (#1178)

a2f5464

Added reverberation based data augmentation recipe for AMI. Gives gains in IHM, SDM and MDM settings. (TDNN + Chain recipe checked in).

Fix to utils/lang/make_unk_lm.sh, thanks to Xingyu Na for finding the…

d8382ff

… problem.

add results

f8df6ff

add results (#1188)

e38fc92

adding fix where trailing whitespace will be removed

af2386b

check for swig and produce meaningful info

05104b6

check for swig and produce meaningful info (#1192)

288890c

improving the check

aca60d8

style fixes

7a42231

Fix nnet3 endpointing to correctly use frame subsampling factor (#1184)

465b73c

Merge pull request #1191 from jtrmal/arpa2fst_fix

91233b8

fixing arpa2fst to allow traling whitespace in the headers

Change FrameSubsamplingFactor() to non-virtual

d1aca1a

Merge pull request #1196 from alumae/chain-online-endpointing-fix

f66e83e

Fix nnet3 endpointing to correctly use frame subsampling factor (#1184)

Added changes to ensure that the correct left/right contexts are used…

6b30404

… for computation in nnet3 setup. For more details see Issue #1190 (#1194)

fixed max_deriv_time unset issue for BLSTM

0482e82

change {left|right}_deriv_truncate to {min|max}_deriv_time in argumen…

52fabe5

…t lists

Added (B)LSTM scripts for ami/s5b and tedlium/s5_r2 (#1198)

c093b7d

Added (B)LSTM scripts for ami/s5b and tedlium/s5_r2

Remove unused config variable in utils/prepare_lang.sh

8a7e3c8

add self_loop_pdf_class to HmmState

60e2fdd

modify transition-model and context-dep interface

2bd825c

add 2nd version of GetPdfInfo

d960f73

resolve hmm-utils fails

dd48170

add test code for new HmmTopology

ad0d269

Add IsHmm and RM validation

46c9a26

modify chain code and scripts for new TransitionModel

bb1d1c3

Enumerate possible pdf pairs

5d3d6d7

danpovey and others added 29 commits November 21, 2016 17:24

Merge pull request #1105 from naxingyu/modify-transition-model

45d53f1

Modify TransitionModel for more compact chain-model graphs

Some small cosmetic updates regarding last commit [chain transition m…

1a8a3ad

…odel]; also, cosmetic fix to steps/nnet3/chain/train.py

Very small fix to voxforge_prepare_dict.sh to avoid lexicon/lexiconp …

2e1974b

…mismatches.

swbd: updated TDNN+LSTM recipe. Interleaving these layers provides go…

ab308f8

…od improvement. (#1211)

Fix to problem encountered by Vincent Nguyen when using multi-fst inp…

bfe3159

…ut in nnet3-latgen-faster-parallel

Remove compute capability 1.3 from the list of supported capabilities…

a5bd0ef

… (atomicAdd not supported there, needed for chain models).

Modifications to align-text.cc and create_segments_from_ctm.pl to all…

7346471

…ow ';' as a word when those scripts are used. Bug fix in egs/wsj/s5/local/run_segmentation.sh.

Fix to compilation problem on MacOS X Sierra by removing use of signal.h

c364071

Small change to nnet3 compilation so that for updatable components wi…

d8493b6

…th zero learning rates, backprop does not have to be done.

adding low-rank module in xent-branch (#1213)

cc413a6

use native double atomicAdd for CC6.0 and devices; fix bugs;

0cbc041

Fix problem encountered by alexandre nanchen, where LG determinizatio…

0dfaa77

…n is slow if using older-style LM-formatting script. Do this by disabling a recently introduced optimization if disambig-symbol is not specified.

Merge pull request #1218 from kangshiyin/double-precision

5ead2c0

Fix bugs for DOUBLE_PRECISION = 1

speed test and unit test for normalize per row

c887a51

speed test for normlize per row correctness test for normalize per row move test to cu math test fix test bug

some scripts fixes in blstm_6j and tdnn_lstm recipes (#1220)

b878fdb

Look in right location for new style subdirectories

a5e2a04

See #1209

Merge pull request #1217 from kangshiyin/normalize-prop

2266cf2

New CUDA kernel for NormalizeComponet::propagate

Use correct subdirectories in Fisher English recipes

7b47ebe

For #1223

Merge pull request #1223 from aevernon/master

2bd1c4a

Look in right location for new style subdirectories

WIP : original LM from TedliumRelease2 (#1164)

b710d78

* This commit modifies the tedlium s5_r2 setup to use the original LM from TedliumRelease2 (instead of an LM built from the cantab-tedlium data).

Fixes to CUDA problems introduced by recent commits: fix normalize co…

87465f5

…de crash; compile problem on nvcc 8.0; fix thread-sync errors. (#1228) This fixes a synchronization problem introduced by PR #1217 (merged yesterday) that can cause crashes in TDNN training.

fix assert; use fmax overloading that has been documented. (#1230)

fa64e62

ami : added TDNN+LSTM scripts for AMI. (#1212)

4435248

swbd : added results for the TDNN recipe which uses the subset-dim option

ami/TDNN+LSTM: changed left-deriv-truncate to deriv-truncate-margin (#…

5f51486

…1231)

Reduce wget num-tries and timeout for http://openfst.cs.nyu.edu/ to f…

335ae0b

…ailover to openslr faster (server is often down).

Lexicon learning (#976)

03e6b92

* This pull request implements an automatic method of finding likely bugs in a lexicon, and providing suggested fixes. Useful if your lexicon is incomplete or contains errors.

dresen merged commit e840588 into dresen:master Dec 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update after from original #5

Update after from original #5

Uh oh!

dresen commented Dec 2, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Update after from original #5

Update after from original #5

Uh oh!

Conversation

dresen commented Dec 2, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants