[TTS] Add additional config to preprocess_text and compute_feature_stats #7321

rlangman · 2023-08-25T17:00:37Z

What does this PR do ?

Add some configuration to TTS preprocessing to make them more usable.

Most important, adding batch_size to text normalization multi-processing which I have found to make it about 10x faster than using the default 'auto' batch size in joblib. I do not fully understand why, but the documentation indicates that large batch sizes work for small, fast tasks:

batch_size: int or 'auto', default: 'auto'
The number of atomic tasks to dispatch at once to each
worker. When individual evaluations are very fast, dispatching
calls to workers can be slower than sequential computation because
of the overhead. Batching fast computations together can mitigate
this.

Collection: [TTS]

Changelog

Add batch_size parameter to preprocess_text.py
Add input and output field parameters to preprocess_text.py
Fix boolean flag parsing for lower_case for preprocess_text.py
Modify compute_feature_stats.py so that it can take a list of manifests, making it easy to compute pitch/energy stats across multiple datasets.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

scripts/dataset_processing/tts/preprocess_text.py

racoiaws · 2023-08-25T17:13:07Z

Otherwise LGTM

Signed-off-by: Ryan <[email protected]>

…ats (NVIDIA#7321) * [TTS] Add additional config to preprocess_text and compute_feature_stats Signed-off-by: Ryan <[email protected]> * [TTS] Rename batch_size to joblib_batch_size Signed-off-by: Ryan <[email protected]> --------- Signed-off-by: Ryan <[email protected]>

rlangman requested review from XuesongYang, redoctopus, racoiaws and subhankar-ghosh August 25, 2023 17:00

github-actions bot added the TTS label Aug 25, 2023

racoiaws reviewed Aug 25, 2023

View reviewed changes

scripts/dataset_processing/tts/preprocess_text.py Outdated Show resolved Hide resolved

racoiaws approved these changes Aug 28, 2023

View reviewed changes

rlangman force-pushed the tts_preprocess branch from 3d81cb6 to e12222e Compare August 28, 2023 19:01

rlangman added 2 commits August 28, 2023 13:44

[TTS] Add additional config to preprocess_text and compute_feature_stats

a74faad

Signed-off-by: Ryan <[email protected]>

[TTS] Rename batch_size to joblib_batch_size

b33dae6

Signed-off-by: Ryan <[email protected]>

rlangman force-pushed the tts_preprocess branch from e12222e to b33dae6 Compare August 28, 2023 20:44

rlangman merged commit f265ac4 into main Aug 29, 2023
15 checks passed

rlangman deleted the tts_preprocess branch August 29, 2023 00:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TTS] Add additional config to preprocess_text and compute_feature_stats #7321

[TTS] Add additional config to preprocess_text and compute_feature_stats #7321

rlangman commented Aug 25, 2023

racoiaws commented Aug 25, 2023

[TTS] Add additional config to preprocess_text and compute_feature_stats #7321

[TTS] Add additional config to preprocess_text and compute_feature_stats #7321

Conversation

rlangman commented Aug 25, 2023

What does this PR do ?

Changelog

Before your PR is "Ready for review"

racoiaws commented Aug 25, 2023