v1.14 - Curiously Delicious Snowflakes

pzelasko released this 27 Apr 01:00

2eff8bc

What's Changed

New features

Add CLIs for creating Lhotse Shar directories and computing features by @pzelasko in #1042
Integrate torchaudio's 2.0 ffmpeg backend for audio loading + add some optimizations by @pzelasko in #1043
- (note: with PyTorch 2.0 set the following env var: TORCHAUDIO_USE_BACKEND_DISPATCHER=1)
Loudness normalization with pyloudnorm by @desh2608 in #1016

Recipes

New

LibriLight dataset by @yfyeung in #1014
EDACC recipe by @pzelasko in #1022
[Recipe] BUT Reverb DB by @desh2608 in #1028

Improvements

Aishell3 speaker, gender, and tone labels by @zjwang21 in #1027
Small fix for speechcommands.py by @yfyeung in #1012
Minor fix in LibriCSS recipe by @desh2608 in #1021
Add parts and num_jobs options for tedlium by @desh2608 in #1030

Other enhancements

Bump version to 1.14.0.dev and fix .dev version suffix handling by @pzelasko in #1010
Update recording id in the supervision after applying WPE by @desh2608 in #1015
Specify what formats are expected when using the pipe: prefix to load cuts by @srdecny in #1013
Function to drop alignments from cut by @desh2608 in #1019
Fix bug in lazy CutSet subset with last by @desh2608 in #1023
Fix LoudnessNormalization by @lifeiteng in #1029
API to enable/disable ffmpeg-torchaudio by @desh2608 in #1032
Ensure RIR has same sampling rate as audio by @desh2608 in #1037
Add transforms attribute for MixedCut by @desh2608 in #1035
Fix #1038 and #1039 by @pzelasko in #1040

New Contributors

@srdecny made their first contribution in #1013

Full Changelog: v1.13...v1.14

Contributors

lifeiteng, desh2608, and 4 other contributors

Assets 2