TTS FastPitch Finetuning #2684

paarthneekhara · 2021-08-17T21:10:28Z

No description provided.

examples/tts/conf/fastpitch_align_44100.yaml

examples/tts/conf/fastpitch_align_finetuning.yaml

nemo/collections/asr/data/audio_to_text.py

nemo/collections/tts/models/fastpitch.py

tutorials/tts/4_TTS_FastPitch_Finetuning.ipynb

blisc · 2021-08-18T20:53:25Z

tutorials/tts/4_TTS_FastPitch_Finetuning.ipynb

+    "from nemo.collections.tts.models import HifiGanModel\n",
+    "from nemo.collections.tts.models import FastPitchModel\n",
+    "\n",
+    "hifigan_ckpt_path = \"/home/pneekhara/PreTrainedModels/HifiGan--val_loss=0.08-epoch=899.ckpt\"\n",


We probably also need to publish this model to NGC

Yes, will need to do that.

Let's add a placeholder for the FastPitch checkpoint as well, and add a comment that we plan on releasing .nemo files for this notebook soon.

tutorials/tts/4_TTS_FastPitch_Finetuning.ipynb

blisc · 2021-08-19T14:03:57Z

tutorials/tts/4_TTS_FastPitch_Finetuning.ipynb

+    "    plt.show()"
+   ]
+  },
+  {


I think it would also be cool to show how people can fine tune their model on other data. Maybe even use their own voice after recording a few samples.

Can move this to another PR

examples/tts/fastpitch2_finetune.py

Signed-off-by: Paarth Neekhara <[email protected]>

* Added filelists and finetuning code * image logging transoposed bug fix in FastPitch * pytorch lightning version requirement * added filelists for multispeaker hifigan * hifigan 44100 Hz * updated filelists, added another configuration of hifigan generator as per jason * added notebooks and some scripts * synthesize samples script update * removed sample filelists * reverted hifigan changes * reverting hifigan.yaml update Signed-off-by: Paarth Neekhara <[email protected]>

Signed-off-by: Paarth Neekhara <[email protected]>

…or fastpitch dataset, comments in configuration files Signed-off-by: Paarth Neekhara <[email protected]>

Signed-off-by: Paarth Neekhara <[email protected]>

…le finetuning Signed-off-by: Paarth Neekhara <[email protected]>

Signed-off-by: Paarth Neekhara <[email protected]>

* Added speaker in FastPitch2 dataloader Signed-off-by: Paarth Neekhara <[email protected]> * changed fastpitch_align.yaml to have configuration for 441000 Hz Signed-off-by: Paarth Neekhara <[email protected]> * trying to fix g2p Signed-off-by: Paarth Neekhara <[email protected]> * pytorch lightning version requirement Signed-off-by: Paarth Neekhara <[email protected]> * Cleanupfinetuning (NVIDIA#2) * Added filelists and finetuning code * image logging transoposed bug fix in FastPitch * pytorch lightning version requirement * added filelists for multispeaker hifigan * hifigan 44100 Hz * updated filelists, added another configuration of hifigan generator as per jason * added notebooks and some scripts * synthesize samples script update * removed sample filelists * reverted hifigan changes * reverting hifigan.yaml update Signed-off-by: Paarth Neekhara <[email protected]> * added finetuning notebook Signed-off-by: Paarth Neekhara <[email protected]> * removed fastpitch2.py (redundant with fastpitch.py) Signed-off-by: Paarth Neekhara <[email protected]> * restored old fastpitch_align.yaml, made just one finetuning yaml Signed-off-by: Paarth Neekhara <[email protected]> * reverting to old fastpitch align yaml Signed-off-by: Paarth Neekhara <[email protected]> * reverted to old vocabs.py Signed-off-by: Paarth Neekhara <[email protected]> * reverted to old vocabs.py Signed-off-by: Paarth Neekhara <[email protected]> * reverting requirements change Signed-off-by: Paarth Neekhara <[email protected]> * addressed pull request reviews -- updated notebook, speaker loading for fastpitch dataset, comments in configuration files Signed-off-by: Paarth Neekhara <[email protected]> * notebook update Signed-off-by: Paarth Neekhara <[email protected]> * removed redundant configuration file, updated notebook Signed-off-by: Paarth Neekhara <[email protected]> * some more corrections for switching to single configuration file Signed-off-by: Paarth Neekhara <[email protected]> * added url for hifigan dataset in notebook Signed-off-by: Paarth Neekhara <[email protected]> * warning messages if optimizer configuration does not look correct while finetuning Signed-off-by: Paarth Neekhara <[email protected]> * style error fix Signed-off-by: Paarth Neekhara <[email protected]> * dataloader fix after master merge Signed-off-by: Paarth Neekhara <[email protected]> Co-authored-by: Jason <[email protected]>

blisc requested changes Aug 18, 2021

View reviewed changes

blisc reviewed Aug 19, 2021

View reviewed changes

tutorials/tts/4_TTS_FastPitch_Finetuning.ipynb Outdated Show resolved Hide resolved

blisc reviewed Aug 19, 2021

View reviewed changes

blisc mentioned this pull request Aug 19, 2021

Minor FastPitch Fixes #2697

Merged

blisc self-assigned this Aug 19, 2021

blisc reviewed Aug 20, 2021

View reviewed changes

examples/tts/fastpitch2_finetune.py Show resolved Hide resolved

paarthneekhara added 19 commits August 20, 2021 10:54

Added speaker in FastPitch2 dataloader

1383553

Signed-off-by: Paarth Neekhara <[email protected]>

changed fastpitch_align.yaml to have configuration for 441000 Hz

2d68e76

Signed-off-by: Paarth Neekhara <[email protected]>

trying to fix g2p

fad6d3a

Signed-off-by: Paarth Neekhara <[email protected]>

pytorch lightning version requirement

abff093

Signed-off-by: Paarth Neekhara <[email protected]>

added finetuning notebook

f91a9e0

Signed-off-by: Paarth Neekhara <[email protected]>

removed fastpitch2.py (redundant with fastpitch.py)

93c4379

Signed-off-by: Paarth Neekhara <[email protected]>

restored old fastpitch_align.yaml, made just one finetuning yaml

715ad0c

Signed-off-by: Paarth Neekhara <[email protected]>

reverting to old fastpitch align yaml

bd06780

Signed-off-by: Paarth Neekhara <[email protected]>

reverted to old vocabs.py

b97b095

Signed-off-by: Paarth Neekhara <[email protected]>

reverted to old vocabs.py

a751785

Signed-off-by: Paarth Neekhara <[email protected]>

reverting requirements change

1b5291f

Signed-off-by: Paarth Neekhara <[email protected]>

addressed pull request reviews -- updated notebook, speaker loading f…

68cb863

…or fastpitch dataset, comments in configuration files Signed-off-by: Paarth Neekhara <[email protected]>

notebook update

6d53329

Signed-off-by: Paarth Neekhara <[email protected]>

removed redundant configuration file, updated notebook

d917be3

Signed-off-by: Paarth Neekhara <[email protected]>

some more corrections for switching to single configuration file

322db0c

Signed-off-by: Paarth Neekhara <[email protected]>

added url for hifigan dataset in notebook

bf78860

Signed-off-by: Paarth Neekhara <[email protected]>

warning messages if optimizer configuration does not look correct whi…

8a6f8b0

…le finetuning Signed-off-by: Paarth Neekhara <[email protected]>

style error fix

e9dfa32

Signed-off-by: Paarth Neekhara <[email protected]>

paarthneekhara force-pushed the main branch from 8ae0820 to e9dfa32 Compare August 20, 2021 17:54

blisc approved these changes Aug 20, 2021

View reviewed changes

paarthneekhara added 2 commits August 24, 2021 16:44

Merge branch 'main' into main

c4af0c9

dataloader fix after master merge

92b8aca

Signed-off-by: Paarth Neekhara <[email protected]>

paarthneekhara force-pushed the main branch from 387b3b2 to 92b8aca Compare August 25, 2021 16:23

Merge branch 'main' into main

49761df

blisc merged commit 0d7de7c into NVIDIA:main Aug 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS FastPitch Finetuning #2684

TTS FastPitch Finetuning #2684

paarthneekhara commented Aug 17, 2021

blisc Aug 18, 2021

paarthneekhara Aug 18, 2021

blisc Aug 19, 2021

blisc Aug 19, 2021

blisc Aug 19, 2021

TTS FastPitch Finetuning #2684

TTS FastPitch Finetuning #2684

Conversation

paarthneekhara commented Aug 17, 2021

blisc Aug 18, 2021

Choose a reason for hiding this comment

paarthneekhara Aug 18, 2021

Choose a reason for hiding this comment

blisc Aug 19, 2021

Choose a reason for hiding this comment

blisc Aug 19, 2021

Choose a reason for hiding this comment

blisc Aug 19, 2021

Choose a reason for hiding this comment