v1.13 - Local Freezing
What's Changed
New tutorials
Recipes
New
- CSJ: Faithful Manifest by @teowenshen in #940
- himia dataset by @glynpu in #991
- Speech Commands v0.01 & v0.02 dataset by @yfyeung in #996
- Aishell3 by @zjwang21 in #998
Fixes
- add dataset-parts argument to libritts by @lifeiteng in #956
- Add option to create segments for LibriCSS by @desh2608 in #961
- fix tal_csasr data pre-processing by @KajiMaCN in #975
- Fix wrong Common Voice parsing by @trunglebka in #979
- add the download function of commonvoice command line interface by @manbaaaa in #968
- store LJSpeech normalized text by @lifeiteng in #988
- LJSpeech strip normalized text by @lifeiteng in #992
New features
- Optional quadratic duration correction for dynamic bucketing sampler by @pzelasko in #950
- Tentative lhotse --> kaldi manifests conversion for multiple channels by @popcornell in #962
- Add RecordingChunkIterableDataset by @pzelasko in #985
- Python 3.11 support by @pzelasko in #866
(cut|recording).dereverb_wpe() API
+ more stable numpy version by @pzelasko in #1000
General improvements
- Release v1.12, bump dev version to 1.13.0.dev by @pzelasko in #945
- Batch extraction for kaldi features by @desh2608 in #947
- Fix features_lens for rare failure cases by @pzelasko in #953
- add 'decode_options' to annotate_with_whisper by @Joemgu7 in #954
- Preserve custom field when convering MultiCut to MonoCuts by @pzelasko in #957
- Fix libritts dataset-parts by @lifeiteng in #960
- Handling of -1 in the segments file by @JinZr in #952
- Bug in
trim_to_supervision_groups
+ tolerance foroverspans
by @desh2608 in #963 - Fix for OnTheFlyFeatures with batched inference by @pzelasko in #965
- Allow dash in SequentialJsonlWriter by @desh2608 in #967
- Support
move_to_memory
forMixedCut
andPaddingCut
by @pzelasko in #970 - Add new method
MixedCut.to_mono()
by @pzelasko in #973 - Add multiprocessing to meeting simulation workflow by @desh2608 in #972
- Kaldi-import: floor wav duration to milliseconds by @vesis84 in #971
- Fix bug in computing same speaker pause distribution by @desh2608 in #974
- Add padding direction when using transform ExtraPadding by @marcoyang1998 in #980
- make sure the ProcessPoolExecutor executor uses spawn context by @jtrmal in #982
- Small fix for example in class SupervisionSegment by @yfyeung in #994
- Changing devices in Fbank by @Tomiinek in #999
- Fix for issues #1001 by @yfyeung in #1002
- Fix typo in https://github.com/lhotse-speech/lhotse/blob/master/lhotse/cut/mixed.py line 76 by @yfyeung in #1003
- Add cache for KaldiReader by @david20181 in #1004
- Fix load_kaldi_dara_dir not loading segments and feats.scp correctly by @yasumori in #1005
- Fix minor bug in conversational meeting simulation algorithm by @desh2608 in #1007
New Contributors
- @Joemgu7 made their first contribution in #954
- @lifeiteng made their first contribution in #956
- @vesis84 made their first contribution in #971
- @KajiMaCN made their first contribution in #975
- @marcoyang1998 made their first contribution in #980
- @manbaaaa made their first contribution in #968
- @yfyeung made their first contribution in #994
- @zjwang21 made their first contribution in #998
- @david20181 made their first contribution in #1004
- @yasumori made their first contribution in #1005
Full Changelog: v1.12...v1.13