v1.9 Neighboring Peaks
Major features
MultiCut
data type: simplifies working with multi-channel data (contribution from @desh2608)- CSJ recipe (contribution from @teowenshen)
- lots of bug fixes
What's Changed
- create proper wav_id in the segments file for multichannel recording by @jtrmal in #831
- kaldi: add an switch/option to read the durations from kaldi utt2dur … by @jtrmal in #832
- Update test packages by @pzelasko in #837
MultiCut
to store multi-channel recordings with shared supervision by @desh2608 in #822- Use CutSet for whisper annotation workflow by @desh2608 in #834
- use spawn() as the strategy to prevent heisenbug by @jtrmal in #841
- Compatibility for reading alignments saved before Lhotse v1.8 by @pzelasko in #842
- make regexp string raw by @jtrmal in #836
- Use absolute recording paths in yesno recipe by @pzelasko in #845
- Fix CutSet.compute_and_store_features support for lazy CutSets by @pzelasko in #844
- Fixing some QA functions for lazy manifests by @desh2608 in #848
- Fix timestamps in Whisper annotation workflow by @pzelasko in #847
- Update supervisions channels in multi-channel recipes by @desh2608 in #838
- Allow retaining or trimming channels in trim_to_supervisions by @desh2608 in #852
- Match
cut_id
toutt_id
if there is exactly one supervision per cut by @wgb14 in #853 - forced alignment: use
num2words
to get word timestamps for numbers by @eschmidbauer in #849 - Prepare CSJ by @teowenshen in #851
- Small changes in
trim_to_supervisions()
by @desh2608 in #855 - Fix checkpoints of samplers that were iterated over more than once within the same epoch by @pzelasko in #854
- Update fisher_english.py by @maxlvov in #858
New Contributors
- @eschmidbauer made their first contribution in #849
- @teowenshen made their first contribution in #851
- @maxlvov made their first contribution in #858
Full Changelog: v1.8...v1.9