v1.16 - Mountain Warming
What's Changed
Recipes
New:
- Add speech translation corpus MuST-C by @csukuangfj in #1079
- Extend LibriTTS recipe to support LibriTTS-R by @pzelasko in #1082
- SURT dataset by @desh2608 in #951
- [Recipe] VoxPopuli by @desh2608 in #1089
- Air Traffic Control (ATC) corpora - various improvements 2 by @rouseabout in #1090
- Add Bengali.AI Speech corpus for Kaggle Research Code Competition by @yfyeung in #1108
- Support AudioMNIST by @csukuangfj in #1093
Improvements:
- Add multithread to peoples_speech by @yfyeung in #1078
- Add multiprocess mechanism for Common Voice by @yfyeung in #1025
- text normalization for aishell4 by @JinZr in #1103
- updated text norm for aishell2 by @JinZr in #1104
- updated text norm for magicdata by @JinZr in #1105
- PR for the KeSpeech recipe by @JinZr in #1106
- Small changes in some existing recipes by @desh2608 in #1110
New features
CutSet.from_files
constructor for random order multi-file cutsets by @pzelasko in #1085- Infinite random-file random-line stateless sampler by @pzelasko in #1102
StatelessSampler
: remove TRNG, makebase_seed
a mandatory parameter by @pzelasko in #1109
Other improvements
- Loudness fix by @desh2608 in #1087
- Update SpecAug custom state_dict to be compatible with PyTorch by @osadj in #1091
- Adding log spectrogram by @Tomiinek in #1094
- add user defined kaldi feature type by @ZihanLiao in #1101
- Update cuts.rst by @taras-sereda in #1112
- Add alignment scores from CTM by @desh2608 in #1116
- Fix and enhance TIMIT recipe by @flyingleafe in #1072
- Fixing shar reader assert by @Tomiinek in #1117
- Support
del (cut|supervision).custom_field
by @pzelasko in #1113 - Exposing
tolerance
for matching supervisions with features in Kaldi imported data dirs by @pzelasko in #1119 - Some additional options for supervision-related methods by @desh2608 in #1115
New Contributors
- @osadj made their first contribution in #1091
- @ZihanLiao made their first contribution in #1101
- @taras-sereda made their first contribution in #1112
Full Changelog: v1.15...v1.16