v1.18 - The Ice Age
What's Changed
New features
- MMS forced alignment backend by @flyingleafe in #1185
- Two new options:
CutSet.from_shar(seed="trng")
andDynamicCutSampler(quadratic_duration=...)
by @pzelasko in #1199 - Faster initialization option in
DynamicBucketingSampler
+ various fixes by @pzelasko in #1210 - CLI to estimate and print bucket bins for a cut set by @pzelasko in #1214
- More flexible setting of audio backends by @pzelasko in #1219
Recipes
- Add recipe for Medical Corpus by @yfyeung in #1212
- minor fix for the AMI recipe by @JinZr in #1178
- fixes compatibility with Edin. ver. VCTK dataset by @JinZr in #1182
- Minor bug fix for eval2000 recipe by @JinZr in #1127
- support far field data for icmcasr challenge by @yuekaizhang in #1189
- fixed text norm for
tal_csasr
by @JinZr in #1198 #1213
Other improvements
MixedCut.truncate
: fix the case when onlyPaddingCut
s are left by @flyingleafe in #1157- Fix some potential problems in OPUS file reading by @yangb05 in #1181
- fix an issue where 404 exception leaves 0 byte placeholder by @JinZr in #1190
- Prevent accidental renaming when using with_suffix by @chiiyeh in #1192
- Fix shar export for
num_jobs>1
and recordings with transforms by @pzelasko in #1196 - fix speaker error by @yzmyyff in #1197
- Fix for
trim_to_alignments
issue by @desh2608 in #1193 - Add
deterministic_rng
to more flaky tests by @pzelasko in #1200 - update_recipes by @vesis84 in #1208
- SpeechSynthesisDataset returns
speaker_ids
by @JinZr in #1206 - Fix audio backend selection by @pzelasko in #1216
- save sdm files into a single mdm file to do gss by @yuekaizhang in #1221
- Modify SpeechSynthesisDataset class, make it return text by @yaozengwei in #1205
- Allow lhotse installation without torchaudio for a limited set of features by @pzelasko in #1231
- Use
attacut
module for Thai word tokenization (in MMS forced alignment) by @flyingleafe in #1232
New Contributors
- @yangb05 made their first contribution in #1181
- @chiiyeh made their first contribution in #1192
- @yzmyyff made their first contribution in #1197
- @yaozengwei made their first contribution in #1205
Full Changelog: v1.17...v1.18