v1.14 - Curiously Delicious Snowflakes
What's Changed
New features
- Add CLIs for creating Lhotse Shar directories and computing features by @pzelasko in #1042
- Integrate torchaudio's 2.0 ffmpeg backend for audio loading + add some optimizations by @pzelasko in #1043
- (note: with PyTorch 2.0 set the following env var:
TORCHAUDIO_USE_BACKEND_DISPATCHER=1
)
- (note: with PyTorch 2.0 set the following env var:
- Loudness normalization with
pyloudnorm
by @desh2608 in #1016
Recipes
New
- LibriLight dataset by @yfyeung in #1014
- EDACC recipe by @pzelasko in #1022
- [Recipe] BUT Reverb DB by @desh2608 in #1028
Improvements
- Aishell3 speaker, gender, and tone labels by @zjwang21 in #1027
- Small fix for speechcommands.py by @yfyeung in #1012
- Minor fix in LibriCSS recipe by @desh2608 in #1021
- Add
parts
andnum_jobs
options for tedlium by @desh2608 in #1030
Other enhancements
- Bump version to 1.14.0.dev and fix .dev version suffix handling by @pzelasko in #1010
- Update recording id in the supervision after applying WPE by @desh2608 in #1015
- Specify what formats are expected when using the pipe: prefix to load cuts by @srdecny in #1013
- Function to drop alignments from cut by @desh2608 in #1019
- Fix bug in lazy CutSet subset with last by @desh2608 in #1023
- Fix LoudnessNormalization by @lifeiteng in #1029
- API to enable/disable ffmpeg-torchaudio by @desh2608 in #1032
- Ensure RIR has same sampling rate as audio by @desh2608 in #1037
- Add
transforms
attribute for MixedCut by @desh2608 in #1035 - Fix #1038 and #1039 by @pzelasko in #1040
New Contributors
Full Changelog: v1.13...v1.14