Release v1.13 - Local Freezing · lhotse-speech/lhotse

What's Changed

New tutorials

Lhotse Shar tutorial notebook by @pzelasko in #1006

Recipes

New

CSJ: Faithful Manifest by @teowenshen in #940
himia dataset by @glynpu in #991
Speech Commands v0.01 & v0.02 dataset by @yfyeung in #996
Aishell3 by @zjwang21 in #998

Fixes

add dataset-parts argument to libritts by @lifeiteng in #956
Add option to create segments for LibriCSS by @desh2608 in #961
fix tal_csasr data pre-processing by @KajiMaCN in #975
Fix wrong Common Voice parsing by @trunglebka in #979
add the download function of commonvoice command line interface by @manbaaaa in #968
store LJSpeech normalized text by @lifeiteng in #988
LJSpeech strip normalized text by @lifeiteng in #992

New features

Optional quadratic duration correction for dynamic bucketing sampler by @pzelasko in #950
Tentative lhotse --> kaldi manifests conversion for multiple channels by @popcornell in #962
Add RecordingChunkIterableDataset by @pzelasko in #985
Python 3.11 support by @pzelasko in #866
(cut|recording).dereverb_wpe() API + more stable numpy version by @pzelasko in #1000

General improvements

Release v1.12, bump dev version to 1.13.0.dev by @pzelasko in #945
Batch extraction for kaldi features by @desh2608 in #947
Fix features_lens for rare failure cases by @pzelasko in #953
add 'decode_options' to annotate_with_whisper by @Joemgu7 in #954
Preserve custom field when convering MultiCut to MonoCuts by @pzelasko in #957
Fix libritts dataset-parts by @lifeiteng in #960
Handling of -1 in the segments file by @JinZr in #952
Bug in trim_to_supervision_groups + tolerance for overspans by @desh2608 in #963
Fix for OnTheFlyFeatures with batched inference by @pzelasko in #965
Allow dash in SequentialJsonlWriter by @desh2608 in #967
Support move_to_memory for MixedCut and PaddingCut by @pzelasko in #970
Add new method MixedCut.to_mono() by @pzelasko in #973
Add multiprocessing to meeting simulation workflow by @desh2608 in #972
Kaldi-import: floor wav duration to milliseconds by @vesis84 in #971
Fix bug in computing same speaker pause distribution by @desh2608 in #974
Add padding direction when using transform ExtraPadding by @marcoyang1998 in #980
make sure the ProcessPoolExecutor executor uses spawn context by @jtrmal in #982
Small fix for example in class SupervisionSegment by @yfyeung in #994
Changing devices in Fbank by @Tomiinek in #999
Fix for issues #1001 by @yfyeung in #1002
Fix typo in https://github.com/lhotse-speech/lhotse/blob/master/lhotse/cut/mixed.py line 76 by @yfyeung in #1003
Add cache for KaldiReader by @david20181 in #1004
Fix load_kaldi_dara_dir not loading segments and feats.scp correctly by @yasumori in #1005
Fix minor bug in conversational meeting simulation algorithm by @desh2608 in #1007

New Contributors

@Joemgu7 made their first contribution in #954
@lifeiteng made their first contribution in #956
@vesis84 made their first contribution in #971
@KajiMaCN made their first contribution in #975
@marcoyang1998 made their first contribution in #980
@manbaaaa made their first contribution in #968
@yfyeung made their first contribution in #994
@zjwang21 made their first contribution in #998
@david20181 made their first contribution in #1004
@yasumori made their first contribution in #1005

Full Changelog: v1.12...v1.13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.13 - Local Freezing

What's Changed

New tutorials

Recipes

New features

General improvements

New Contributors

Contributors