v1.17 - Swirling Ice Pick
What's Changed
New supported datasets
- Speech to text translation utilizing 3-way data by @AmirHussein96 in #1099
- "This American Life" dataset recipe by @flyingleafe in #1140
- Add VoxConverse recipe by @flyingleafe in #1142
- Add recipe for ICASSP2024 ICMC-ASR Grand Challenge by @yfyeung in #1172
New features
- Initial support for video by @pzelasko in #1151
copy_data
: copyCutSet
+ its data to a new location by @pzelasko in #1130- Add whisper feature extractor by @yuekaizhang in #1159
- VAD workflow with Silero by @rilshok in #1160
Enhancements and fixes
- Fix feature extraction for lhotse shar CLI by @pzelasko in #1123
- Add m4a to special cases for num samples determination by @pzelasko in #1124
- making the kaldi import more robust by @vesis84 in #1129
- Tutorial materials in main readme page by @pzelasko in #1133
- optimize save_audios() by @vesis84 in #1131
- Fix bugs in
resumable_download
by @flyingleafe in #1135 - Arxiv badge by @desh2608 in #1136
- Fix docs build by @pzelasko in #1137
- Fix failing tests after repairing docs build by @pzelasko in #1138
- Remove deprecated code, make minor cleanups by @pzelasko in #1139
- Enforce deterministic RNG behavior in repeatedly flaky tests by @pzelasko in #1143
- Refactor
audio.py
into smaller modules by @pzelasko in #1144 - Fix broken
save_audio
by @flyingleafe in #1147 - Optimize
cut_into_windows
for long cuts by @flyingleafe in #1150 - Fixes for #1152 #1153 and #1154 by @pzelasko in #1156
- fix bugs in downloading voxpopuli corpus by @DongjiGao in #1165
- Support
export_to_kaldi
on resampled recordings by @sih4sing5hong5 in #1162 - Refactor
CutSet.describe
to enable parallel statistics computation by @pzelasko in #1168 - Allow dashes in feat CLI by @desh2608 in #1169
- Apply deterministic RNG to more unit tests by @pzelasko in #1173
- Add
fix_manifests
in all recipes by @desh2608 in #1128 - Fix small bug in eval2000 by @desh2608 in #1126
- Fix download in LibriCSS recipe by @desh2608 in #1148
New Contributors
- @sih4sing5hong5 made their first contribution in #1162
- @rilshok made their first contribution in #1160
Full Changelog: v1.16...v1.17