v1.8 Sudden Avalanche
Breaking changes
- Python 3.6 is no longer supported as of Lhotse v1.8. If you need to use Python 3.6, please revert to Lhotse 1.7 and earlier.
Highlights
- New experimental module of lhotse:
workflows
, now integrates optional third party packages that assist corpus creators in automated data curation. With release 1.8, we support OpenAI Whisper for automatic transcription and segmentation, and torchaudio Wav2Vec2/Hubert ASR bundles for forced alignment.
What's Changed
- Fix read and write in piped CLI by @desh2608 in #807
- Default behavior of CutSet.mix by @ZuoyunZheng in #809
- Adding more info about resampling options by @RuABraun in #815
- Add
pad_silence
option toextend_by
by @desh2608 in #816 - Message when calling len() on LazyFilter by @desh2608 in #817
- Refactor cut and retain
git blame
history by @desh2608 in #820 - Audio backend refactoring and a workaround for FLAC reading from/writing to in-memory buffers by @pzelasko in #814
- Experimental Lhotse feature: corpus creation tools (
workflows
), starting with OpenAI Whisper support by @pzelasko in #824 - Drop support for Python 3.6 by @pzelasko in #829
- [workflow] Word-level forced alignment with pretrained models from Torchaudio by @pzelasko in #827
New Contributors
- @ZuoyunZheng made their first contribution in #809
Full Changelog: v1.7...v1.8