Merge master #85

naxingyu · 2019-12-17T07:44:55Z

build and test pass with minor modifications to pybind/Makefile

… for ASR and speaker ID (kaldi-asr#3119) Now multi-style training with noise and reverberation is an option (instead of speed augmentation). Multi-style training seems to be more robust to unseen/noisy conditions.

…d scripts (kaldi-asr#3320)

With a segments file constructed from exact wave file durations some segments came out one sample short. The reason is the multiplication of the float sample frequency and double audio time point is inexact. For example, float 8000.0 multiplied by double 2.03 yields 16239.99999999999, one LSB short of the correct sample number 16240. Also changed all endpoint calculations so that they performed in seconds, not sample numbers, as this does not require a conversion in nearly every comparison, and report positions in diagnostic messages also in seconds, not sample numbers.

…ures (kaldi-asr#3316) Generate utt2dur and utt2num_frames during feature extraction, and store frame period in frame_shift file in feature directory. Copy relevant .conf files used in feature extraction into the conf/ subdirectory with features. Add missing validations and options in some extraction scripts.

…aldi-asr#3322)

Getting utt2dur involves accessing wave files, and potentially running full pipelines in wav.scp, which may take hours for a large data set. If utt2num_frames exists, use it instead if frame rate is known. Issue: kaldi-asr#3303 Fixes: kaldi-asr#3297 "cat: broken pipe"

Fixes typo in kaldi-asr#3119

) subset_data_dir.sh has been refactored thoroughly so that its logic can be followed easier. It has been well tested and dogfooded. All changes here are necessary to subset, combine and verify utt2num_frames, and copy frame_shift to new directories where necessary.

…aldi-asr#3315) Relevant discussion: https://groups.google.com/forum/#!topic/kaldi-help/2uxfByEAmfw

…extract-segments (kaldi-asr#3331)

… fix for kaldi-asr#3119 (kaldi-asr#3334)

…aldi-asr#3335)

…kaldi-asr#3311) This avoids a ping pong of memory to host. Implementation now assumes device memory. interfaces will allocate device memory and copy to it if data starts on host. Add a cuda matrix copy function which clamps rows. This is much faster than copying one row at a time and the kernel can handle the clamping for free.

…an Tobler (kaldi-asr#3347)

…ons (kaldi-asr#3341)

…i-asr#3360)

* Add CUDA accelerated MFCC computation. Creates a new directory 'cudafeat' for placing cuda feature extraction components as it is developed. Added a directory 'cudafeatbin' for placing binaries that are cuda accelerated that mirrior binaries elsewhere. This commit implements: feature-window-cuda.h/cu which implements a feature window on the device by copying it from a host feature window. feature-mfcc-cuda.h/cu which implements the cuda mfcc feature extractor. compute-mfcc-feats-cuda.cc which mirriors compute-mfcc-feats.cc There were also minor changes to other files. * Only build cuda binaries if cuda is enabled

…ldi-asr#3351) small cuda memory copies are inefficeint because each copy can add multiple micro-seconds of latency. The code as written would copy a small matrices or vectors to and from the tasks one after another. To avoid this i've implemented a batched matrix copy routine. This takes arrays of matrix descriptions for the input and output and batches the copies in a single kernel call. This is used in both FormatInputs and FormatOutputs to reduce launch latency overhead. The kernel for the batched copy uses a trick to avoid a memory copy of the host paramters. The parameters are put into a struct containing a static sized array. These parameters are then marshalled like normal cuda parameters. This avoids additional launch latency overhead. There is still more work to do at the beginning and end of nnet3. In particular we may want to batch the clamped memory copies and the large number of D2D copies at the end. I haven't fully tracked those down and may return to them in the future.

…aldi-asr#3326)

…r#3358) - end the training when there is no more data to refill one of the streams, - this avoids overtraining to the 'last' utterance,

…itch (kaldi-asr#3727)" (kaldi-asr#3728) This reverts commit 59255ae.

* make nvcc+msvc happier when two-phase name lookup involved, NOTE: in simple native case (CPU, without cuda), msvc is happy with TemplatedBase<T>::base_value, but nvcc is not... * position of __restrict__ in msvc is restricted

…ed build (kaldi-asr#3580)

… present, to balance splits

…t2dur if present, to balance splits" (kaldi-asr#3746) This reverts commit 1d0b267.

…asr#3699)

…#3752)

…cases (kaldi-asr#3756)

…(missing phones.txt) (kaldi-asr#3757)

…me platforms (kaldi-asr#3759)

* changed scoring tool for diarization * added comment for scoring * fixing number of deletions, adding script to check DP result of the total errors is equivalent to the sum of the individual errors * updated RESULTS for new diarization scoring * outputing wer similar to compute-wer routine * adding routine to select best LM weight and insertion penalty factor based on the development set * updating results * changing lang_chain to lang, minor fix * adding all array option * change in directory structure of scoring_kaldi_multispeaker to make it similar to scoring_kaldi * removing test sets from run ivector script * added ref RTTM creation * making modifications for all array * minor fix

…e extractor (kaldi-asr#3764)

…on) (kaldi-asr#3767)

…sr#3750)

…ng (kaldi-asr#3770)

…those vectors (kaldi-asr#3768)

csukuangfj · 2019-12-17T07:54:36Z

aha, I see.

vimalmanohar and others added 30 commits May 11, 2019 20:37

[egs] New chime-5 recipe (kaldi-asr#2893)

e922333

[egs] updated local/musan.sh to steps/data/make_musan.sh in speaker i…

cec8958

…d scripts (kaldi-asr#3320)

[build] Initial version of Docker images for (CPU and GPU versions) (k…

a2e7ba3

…aldi-asr#3322)

[scripts] fix typo/bug in make_musan.py (kaldi-asr#3327)

91609c7

[scripts] Fixed misnamed variable in data/make_musan.py (kaldi-asr#3324)

95e81c0

[scripts] typo fix in augmentation script (kaldi-asr#3329)

0ff318b

Fixes typo in kaldi-asr#3119

[scripts] Extend combine_ali_dirs.sh to combine alignment lattices (k…

c8b93bc

…aldi-asr#3315) Relevant discussion: https://groups.google.com/forum/#!topic/kaldi-help/2uxfByEAmfw

[src] Fix rare case when segment end rounding overshoots file end in …

528e072

…extract-segments (kaldi-asr#3331)

[scripts] Change --modify-spk-id default to False; back-compatibility…

8397e05

… fix for kaldi-asr#3119 (kaldi-asr#3334)

[build] Add easier configure option in failure message of configure (k…

8b54ef8

…aldi-asr#3335)

[scripts,minor] Fix typo in comment (kaldi-asr#3338)

ce8798b

[src,egs] Add option for applying SVD on trained models (kaldi-asr#3272)

9e0a7f6

[build] Update GCC support check for CUDA toolkit 10.1 (kaldi-asr#3345)

52e7ecf

[egs] Fix to aishell1 v1 download script (kaldi-asr#3344)

29f3c14

[scripts] Support utf-8 files in some scripts (kaldi-asr#3346)

a5dd6bd

[src] Fix potential underflow bug in MFCC, RE energy floor, thx: Zolt…

8c6cd31

…an Tobler (kaldi-asr#3347)

[scripts]: add warning to nnet3/chain/train.py about ineffective opti…

e643c73

…ons (kaldi-asr#3341)

[scripts] Fix regarding UTF handling in cleanup script (kaldi-asr#3352)

8706f06

[scripts] Change encoding to utf-8 in data augmentation scripts (kald…

800924d

…i-asr#3360)

[scripts,minor] Remove outdated comment (kaldi-asr#3361)

16097b4

[egs] A kaldi recipe based on the corpus named "aidatatang_200zh". (k…

ced53e1

…aldi-asr#3326)

[src] nnet1: changing end-rule in 'nnet-train-multistream', (kaldi-as…

f8a4376

…r#3358) - end the training when there is no more data to refill one of the streams, - this avoids overtraining to the 'last' utterance,

danpovey and others added 27 commits November 21, 2019 20:18

Revert "[src] Making ivector extractor tolerate dim mismatch due to p…

eb28a6a

…itch (kaldi-asr#3727)" (kaldi-asr#3728) This reverts commit 59255ae.

[build] Add CMake Build System as alternative to current Makefile-bas…

f88c475

…ed build (kaldi-asr#3580)

[scripts] Modify split_data_dir.sh and split_scp.pl to use utt2dur if…

1d0b267

… present, to balance splits

[scripts] fix slurm.pl error (kaldi-asr#3745)

915bb78

Revert "[scripts] Modify split_data_dir.sh and split_scp.pl to use ut…

666b8cb

…t2dur if present, to balance splits" (kaldi-asr#3746) This reverts commit 1d0b267.

[egs] Children's speech ASR recipe for cmu_kids and cslu_kids (kaldi-…

413c7c8

…asr#3699)

[src] Incremental determinization [cleaned up/rewrite] (kaldi-asr#3737)

1cd7ee9

[scripts] Add scripts to create combine fmllr-tranform dirs(kaldi-asr…

d77457d

…#3752)

[src] CUDA decoder: fix invalid-lattice error that happens in corner …

018d180

…cases (kaldi-asr#3756)

[egs] Add Chime 6 baseline system (kaldi-asr#3755)

be2dbf4

[scripts] Fix issue in copy_lat_dir.sh affecting combine_lat_dirs.sh …

daf9d6e

…(missing phones.txt) (kaldi-asr#3757)

[src] Add missing #include, needed for CUDA decoder compilation on so…

07d02da

…me platforms (kaldi-asr#3759)

[scripts] fix bug in steps/data/reverberate_data_dir.py (kaldi-asr#3762)

6f329a6

[src] CUDA allocator: fix order of next largest block (kaldi-asr#3739)

d0007f3

[src] CUDA decoding: add support for affine transforms to CUDA featur…

42c3888

…e extractor (kaldi-asr#3764)

[src] relax assertion constraint slightly (RE matrix orthonormalizati…

5ca36b9

…on) (kaldi-asr#3767)

[src] CUDA decoder: fix bug in NumPendingTasks() (kaldi-asr#3769)

cba392c

[src] Add options to select specific gpu, reuse cuda context (kaldi-a…

ea5757a

…sr#3750)

[src] Move CheckAndFix to config struct (kaldi-asr#3749)

799e1f0

[egs,scripts] Add recipes for CN-Celeb (kaldi-asr#3758)

e27bbda

[src] CUDA decoder: remove unecessary sync that was added for debuggi…

1be4750

…ng (kaldi-asr#3770)

[src] CUDA decoder: shrink channel vectors instead of vector holding …

fe7b922

…those vectors (kaldi-asr#3768)

Merge remote-tracking branch 'origin/master' into HEAD

9582e71

add include path

255c666

update test

eadcf1f

danpovey merged commit ee0fdfd into danpovey:pybind11 Dec 17, 2019

naxingyu deleted the merge-master branch December 17, 2019 08:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge master #85

Merge master #85

Uh oh!

naxingyu commented Dec 17, 2019

Uh oh!

csukuangfj commented Dec 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Merge master #85

Merge master #85

Uh oh!

Conversation

naxingyu commented Dec 17, 2019

Uh oh!

csukuangfj commented Dec 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants