LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-speech

Demos are available at: https://thuhcsi.github.io/LightGrad/

Setup Environment

Install python 3.10.

Then, run:

git clone --recursive https://github.com/thuhcsi/LightGrad.git
python -m pip install -r requirements.txt

Training

Preprocess for BZNSYP

Download dataset from url. Run

python preprocess.py bznsyp [PATH_TO_DIRECTORY_CONTAINING_DATASET] \
    [PATH_TO_DIRECTORY_FOR_SAVING_PREPROCESS_RESULTS] \
    --test_sample_count 200 --valid_sample_count 200

This will produce phn2id.json, train_dataset.json, test_dataset.json, valid_dataset.json in [PATH_TO_DIRECTORY_FOR_SAVING_PREPROCESS_RESULTS].

Preprocess for LJSpeech

Download dataset from url. Run

python preprocess.py ljspeech [PATH_TO_DIRECTORY_CONTAINING_DATASET] \
    [PATH_TO_DIRECTORY_FOR_SAVING_PREPROCESS_RESULTS] \
    --test_sample_count 200 --valid_sample_count 200

This will produce phn2id.json, train_dataset.json, test_dataset.json, valid_dataset.json in [PATH_TO_DIRECTORY_FOR_SAVING_PREPROCESS_RESULTS].

Training for BZNSYP

Edit config/bznsyp_config.yaml, set train_datalist_path, valid_datalist_path, phn2id_path and log_dir. Run:

python train.py -c config/bznsyp_config.yaml

Training for LJSpeech

Edit config/ljspeech_config.yaml, set train_datalist_path, valid_datalist_path, phn2id_path and log_dir. Run:

python train.py -c config/ljspeech_config.yaml

Inference

Edit inference.ipynb. Set HiFiGAN_CONFIG, HiFiGAN_ckpt and ckpt_path to corresponding files, respectively.

Note: add_blank in inference.ipynb should be the same as that in LightGrad/dataset.py.

References

Our model is based on Grad-TTS.
HiFi-GAN is used as vocoder.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LightGrad		LightGrad
config		config
dataset		dataset
hifi_gan @ 4769534		hifi_gan @ 4769534
text		text
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
inference.ipynb		inference.ipynb
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-speech

Setup Environment

Training

Preprocess for BZNSYP

Preprocess for LJSpeech

Training for BZNSYP

Training for LJSpeech

Inference

References

About

Releases

Packages

Languages

License

thuhcsi/LightGrad

Folders and files

Latest commit

History

Repository files navigation

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-speech

Setup Environment

Training

Preprocess for BZNSYP

Preprocess for LJSpeech

Training for BZNSYP

Training for LJSpeech

Inference

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages