Skip to content

Releases: lhotse-speech/lhotse

v1.21 - Glaciology

13 Feb 19:57
769c273
Compare
Choose a tag to compare

What's Changed

This release patches lhotse to handle cases when libsox is not available for torchaudio. The audio backend code went through additional round of refactoring, and libsndfile is now preferred as a default since it showed faster audio decoding performance in our testing. Going forward, when LHOTSE_AUDIO_BACKEND is set, we will use the same backend for audio loading, audio saving, and reading audio metadata (if possible). This release also adds support for Python 3.12 and PyTorch 2.2.

  • Add VAD to Supervisions in LibriLight Recipe by @yfyeung in #1280
  • Fixes for manifest validation and fixing by @pzelasko in #1284
  • Handle error with cachedir creation gracefully by @pzelasko in #1287
  • AudioBackend specific save_audio and info, managing missing SoX in torchaudio, Python 3.12 / PyTorch 2.2 support, using libsndfile as preferred audio backend by @pzelasko in #1288

Full Changelog: v1.20...v1.21

v1.20 - Pining for the Fjords

31 Jan 20:51
455b20e
Compare
Choose a tag to compare

What's Changed

New features

  • Extended the subset of lhotse that works without installing torchaudio by @pzelasko in #1253 #1255
  • Ensure drop_last=False always returns an equal number of mini-batches by re-distributing and/or duplicating some data by @pzelasko in #1277
  • Improved CPU memory usage and shuffling + bucketing in DynamicBucketingSampler by @pzelasko in #1276
  • Enable seed randomization in dynamic samplers by @pzelasko in #1278

Recipes

Other improvements

  • Update docs with env vars used by Lhotse by @pzelasko in #1252
  • support whisper large v3; deepspeed launcher rank world_size setting by @yuekaizhang in #1260
  • Fix non-deterministic tests by @pzelasko in #1261
  • Fix duplication issues in CutSet.mix() by @pzelasko in #1268
  • Support controllable CutSet.mux weights in multiprocess dataloading by @pzelasko in #1266
  • Fix distributed sampler initialization and exceeded sampler warning false positives by @pzelasko in #1270
  • Install kaldi-native-io explicitly in the kaldi doc example. by @csukuangfj in #1275
  • Allow duplicate cut IDs in a CutSet (CutSet is list-like instead of dict-like) by @pzelasko in #1279

New Contributors

Full Changelog: v1.19...v1.20

v1.19 - The Iceberger

02 Jan 14:58
3e53b68
Compare
Choose a tag to compare

What's Changed

Features

Recipes

Other improvements

Full Changelog: v1.18...v1.19

v1.18 - The Ice Age

11 Dec 14:10
78b3a12
Compare
Choose a tag to compare

What's Changed

New features

  • MMS forced alignment backend by @flyingleafe in #1185
  • Two new options: CutSet.from_shar(seed="trng") and DynamicCutSampler(quadratic_duration=...) by @pzelasko in #1199
  • Faster initialization option in DynamicBucketingSampler + various fixes by @pzelasko in #1210
  • CLI to estimate and print bucket bins for a cut set by @pzelasko in #1214
  • More flexible setting of audio backends by @pzelasko in #1219

Recipes

Other improvements

New Contributors

Full Changelog: v1.17...v1.18

v1.17 - Swirling Ice Pick

08 Oct 23:31
9c80a1e
Compare
Choose a tag to compare

What's Changed

New supported datasets

New features

Enhancements and fixes

New Contributors

Full Changelog: v1.16...v1.17

v1.16 - Mountain Warming

11 Aug 19:36
aa073f6
Compare
Choose a tag to compare

What's Changed

Recipes

New:

Improvements:

New features

  • CutSet.from_files constructor for random order multi-file cutsets by @pzelasko in #1085
  • Infinite random-file random-line stateless sampler by @pzelasko in #1102
  • StatelessSampler: remove TRNG, make base_seed a mandatory parameter by @pzelasko in #1109

Other improvements

New Contributors

Full Changelog: v1.15...v1.16

v1.15 - Magmatic Fractionation

27 May 00:20
3071ade
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.14...v1.15

v1.14 - Curiously Delicious Snowflakes

27 Apr 01:00
Compare
Choose a tag to compare

What's Changed

New features

  • Add CLIs for creating Lhotse Shar directories and computing features by @pzelasko in #1042
  • Integrate torchaudio's 2.0 ffmpeg backend for audio loading + add some optimizations by @pzelasko in #1043
    • (note: with PyTorch 2.0 set the following env var: TORCHAUDIO_USE_BACKEND_DISPATCHER=1)
  • Loudness normalization with pyloudnorm by @desh2608 in #1016

Recipes

New

Improvements

Other enhancements

New Contributors

Full Changelog: v1.13...v1.14

v1.13 - Local Freezing

23 Mar 14:26
Compare
Choose a tag to compare

What's Changed

New tutorials

Recipes

New

Fixes

New features

  • Optional quadratic duration correction for dynamic bucketing sampler by @pzelasko in #950
  • Tentative lhotse --> kaldi manifests conversion for multiple channels by @popcornell in #962
  • Add RecordingChunkIterableDataset by @pzelasko in #985
  • Python 3.11 support by @pzelasko in #866
  • (cut|recording).dereverb_wpe() API + more stable numpy version by @pzelasko in #1000

General improvements

New Contributors

Full Changelog: v1.12...v1.13

v1.12 - Spicy Yak

17 Jan 00:41
c33345d
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.11...v1.12