v1.20 - Pining for the Fjords
What's Changed
New features
- Extended the subset of lhotse that works without installing torchaudio by @pzelasko in #1253 #1255
- Ensure
drop_last=False
always returns an equal number of mini-batches by re-distributing and/or duplicating some data by @pzelasko in #1277 - Improved CPU memory usage and shuffling + bucketing in
DynamicBucketingSampler
by @pzelasko in #1276 - Enable seed randomization in dynamic samplers by @pzelasko in #1278
Recipes
- Fluent Speech Commands dataset, SLU task by @HSTEHSTEHSTE in #1272
Other improvements
- Update docs with env vars used by Lhotse by @pzelasko in #1252
- support whisper large v3; deepspeed launcher rank world_size setting by @yuekaizhang in #1260
- Fix non-deterministic tests by @pzelasko in #1261
- Fix duplication issues in CutSet.mix() by @pzelasko in #1268
- Support controllable
CutSet.mux
weights in multiprocess dataloading by @pzelasko in #1266 - Fix distributed sampler initialization and
exceeded
sampler warning false positives by @pzelasko in #1270 - Install kaldi-native-io explicitly in the kaldi doc example. by @csukuangfj in #1275
- Allow duplicate cut IDs in a CutSet (CutSet is list-like instead of dict-like) by @pzelasko in #1279
New Contributors
- @HSTEHSTEHSTE made their first contribution in #1272
Full Changelog: v1.19...v1.20