[TTS] Add additional config to preprocess_text and compute_feature_stats #7321
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do ?
Add some configuration to TTS preprocessing to make them more usable.
Most important, adding
batch_size
to text normalization multi-processing which I have found to make it about 10x faster than using the default 'auto' batch size in joblib. I do not fully understand why, but the documentation indicates that large batch sizes work for small, fast tasks:Collection: [TTS]
Changelog
batch_size
parameter topreprocess_text.py
preprocess_text.py
lower_case
forpreprocess_text.py
compute_feature_stats.py
so that it can take a list of manifests, making it easy to compute pitch/energy stats across multiple datasets.Before your PR is "Ready for review"
Pre checks:
PR Type: