Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prompt Learning Notebook Bug Fix #4689

Merged
merged 5 commits into from
Aug 10, 2022
Merged

Conversation

vadam5
Copy link
Contributor

@vadam5 vadam5 commented Aug 5, 2022

Prompt Learning Notebook Bug Fix

Signed-off-by: Virginia Adams [email protected]

Adds back ability to handle dataset example dicts directly passed into the .generate method for prompt learning

Collection: NLP

Changelog

  • Updated GPTPromptLearningDataset class
  • Updated GPTPromptLearningModel class's generate function
  • Updated prompt learning unit test

Usage

test_examples = [
    {"taskname": "intent_and_slot", "utterance": "tell me who will win the next presidential election"},
    {"taskname": "intent_and_slot", "utterance": "i would like to pickup a veggie sub with a cookie from subway"},
    {"taskname": "intent_and_slot", "utterance": "email happy new year to john"},
    {"taskname": "intent_and_slot", "utterance": "set the alarm to seven am for work"},
    {"taskname": "sentiment", "sentence": "The products have a low salt and fat content ."},
    {"taskname": "sentiment", "sentence": "The agreement is valid for four years ."},
    {"taskname": "sentiment", "sentence": "Diluted EPS rose to EUR3 .68 from EUR0 .50 ."},
    {"taskname": "sentiment", "sentence": "The company is well positioned in Brazil and Uruguay ."},
    {"taskname": "sentiment", "sentence": "Profit before taxes decreased by 9 % to EUR 187.8 mn in the first nine months of 2008 , compared to EUR 207.1 mn a year earlier ."},
]

response = model.generate(inputs=test_examples, length_params=None)

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@vadam5 vadam5 requested a review from yidong72 August 5, 2022 21:54
@vadam5 vadam5 changed the title Added back dataset class list of dict input for generation in tutoria… Prompt Learning Notebook Bug Fix Aug 5, 2022
@okuchaiev okuchaiev merged commit 42f7626 into r1.11.0 Aug 10, 2022
@vadam5 vadam5 deleted the prompt-learning-notebook-fix branch August 10, 2022 23:52
ericharper pushed a commit that referenced this pull request Aug 22, 2022
* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>
ericharper added a commit that referenced this pull request Aug 24, 2022
* NeMo Megatron doc updates

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and dockerfile

Signed-off-by: ericharper <[email protected]>

* fix fastpitch export (#4676)

Signed-off-by: Jason <[email protected]>

* [TTS] fixed wrong pronunciations for r1.11. (#4677)

* [TTS] fixed wrong pronunciations.

Signed-off-by: Xuesong Yang <[email protected]>

* incremented the version number to 22.08 as @blisc suggested.

Signed-off-by: Xuesong Yang <[email protected]>

* correct cmudict versions in world-wide places.

Signed-off-by: Xuesong Yang <[email protected]>

* Fix for incorrect batch size issue while decoding (#4675)

Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* [TTS] incremented the version number to 22.08 in tutorials. (#4684)

* [TTS] incremented the version number to 22.08 in tutorials.

Signed-off-by: Xuesong Yang <[email protected]>

* Megatron encode function with RPE fix (#4692)

* Fix for RPE

Signed-off-by: MaximumEntropy <[email protected]>

* Style

Signed-off-by: MaximumEntropy <[email protected]>

* fix to fetch config file (#4699)

Signed-off-by: nithinraok <[email protected]>

* Fix notebook for buffered inference (#4703)

Signed-off-by: smajumdar <[email protected]>

* Prompt Learning Notebook Bug Fix (#4689)

* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* add psutils to mock imports (#4728)

Signed-off-by: ericharper <[email protected]>

Signed-off-by: ericharper <[email protected]>

* Update Aligner model and tutorial to add NGC checkpoint loading (#4714)

* Update Aligner model and tutorial to add NGC checkpoint loading

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix pynini install for Aligner notebook, minor formatting fix for model

Signed-off-by: Jocelyn Huang <[email protected]>

* Aligner notebook formatting consistency

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] bugfix for missing configs. (#4725)

Signed-off-by: Xuesong Yang <[email protected]>

* docs typo fix

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* Fix pynini install in TTS tutorials (#4729)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix ASR notebooks (#4738)

Signed-off-by: smajumdar <[email protected]>

Signed-off-by: smajumdar <[email protected]>

* Multilingual VAD model (#4734)

* add ngc link

Signed-off-by: fayejf <[email protected]>

* add tuned VAD config on ASR data

Signed-off-by: fayejf <[email protected]>

* yaml note

Signed-off-by: fayejf <[email protected]>

* update vad asr notebook with mVAD

Signed-off-by: fayejf <[email protected]>

* update vad infer config comment

Signed-off-by: fayejf <[email protected]>

* fix

Signed-off-by: fayejf <[email protected]>

* mvad sd config for ch109

Signed-off-by: fayejf <[email protected]>

* update sd readme

Signed-off-by: fayejf <[email protected]>

* add new mVAD model to doc

Signed-off-by: fayejf <[email protected]>

* style fix

Signed-off-by: fayejf <[email protected]>

* update sd tutorial with mVAD

Signed-off-by: fayejf <[email protected]>

* typo fix

Signed-off-by: fayejf <[email protected]>

Signed-off-by: fayejf <[email protected]>

* publish pretrained itn t5 model for English (#4748)

Signed-off-by: Alexandra Antonova <[email protected]>

Signed-off-by: Alexandra Antonova <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>

* Updated docs and doc paths (#4754)

* Updated docs and doc paths

Signed-off-by: Virginia Adams <[email protected]>

* Update Multitask_Prompt_and_PTuning.ipynb

* Update README.rst

* Changed branch name to use single quotes

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* fix bug relating to ddp strategy in joint intent slot classification tutorial (#4762)

* [TTS] updated config with a German IPA phoneme tokenizer (#4756)

* [TTS] added a German IPA phoneme tokenizer
* [TTS][ASR] enabled customized arguments for trimming the leading and trailing silence.
* [TTS] disabled spline interpolation for beta-binomial distribution. Let it generate align prior and save to disks. Use a new phoneme tokenizer.
* [TTS] use consistent spline interpolation with fastpitch checkpoint when generating mel-spectrograms for hifigan finetune.

Signed-off-by: Xuesong Yang <[email protected]>

* Update r1.11 to new heteronyms list (#4745)

* Update configs to new heteronyms list
* Remove old heteronyms list, add alt 'merchandise' pron to CMUdict
* Update remaining references to old heteronyms list

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints (#4763)

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* [TTS] Add single male speaker German FastPitch and HiFiGAN NGC checkpoints (#4770)

Signed-off-by: Xuesong Yang <[email protected]>

* Update CMUdict with more recent 0.7b entries (#4768)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* Install pynini in docker container (#4733)

Signed-off-by: Vladimir Bataev <[email protected]>

Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* Fix tutorial formatting (#4778)

Signed-off-by: Jocelyn Huang <[email protected]>

* [TTS] deprecated old scripts for ljspeech. (#4780)

* deprecated old scripts for ljspeech.
* removed relevent function calls in TTS docs.

Signed-off-by: Xuesong Yang <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and requirements

Signed-off-by: ericharper <[email protected]>

* update container

Signed-off-by: ericharper <[email protected]>

* Update stragglers to new cmudict and heteronyms paths

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Oleksii Kuchaiev <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Virginia Adams <[email protected]>
Signed-off-by: Jocelyn Huang <[email protected]>
Signed-off-by: fayejf <[email protected]>
Signed-off-by: Alexandra Antonova <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: Jason <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Rajesh Ilango <[email protected]>
Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Sandeep Subramanian <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Somshubra Majumdar <[email protected]>
Co-authored-by: Virginia Adams <[email protected]>
Co-authored-by: Jocelyn <[email protected]>
Co-authored-by: fayejf <[email protected]>
Co-authored-by: bene-ges <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>
Co-authored-by: Zhilin Wang <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
piraka9011 pushed a commit to piraka9011/NeMo that referenced this pull request Aug 25, 2022
* NeMo Megatron doc updates

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and dockerfile

Signed-off-by: ericharper <[email protected]>

* fix fastpitch export (NVIDIA#4676)

Signed-off-by: Jason <[email protected]>

* [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677)

* [TTS] fixed wrong pronunciations.

Signed-off-by: Xuesong Yang <[email protected]>

* incremented the version number to 22.08 as @blisc suggested.

Signed-off-by: Xuesong Yang <[email protected]>

* correct cmudict versions in world-wide places.

Signed-off-by: Xuesong Yang <[email protected]>

* Fix for incorrect batch size issue while decoding (NVIDIA#4675)

Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* [TTS] incremented the version number to 22.08 in tutorials. (NVIDIA#4684)

* [TTS] incremented the version number to 22.08 in tutorials.

Signed-off-by: Xuesong Yang <[email protected]>

* Megatron encode function with RPE fix (NVIDIA#4692)

* Fix for RPE

Signed-off-by: MaximumEntropy <[email protected]>

* Style

Signed-off-by: MaximumEntropy <[email protected]>

* fix to fetch config file (NVIDIA#4699)

Signed-off-by: nithinraok <[email protected]>

* Fix notebook for buffered inference (NVIDIA#4703)

Signed-off-by: smajumdar <[email protected]>

* Prompt Learning Notebook Bug Fix (NVIDIA#4689)

* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* add psutils to mock imports (NVIDIA#4728)

Signed-off-by: ericharper <[email protected]>

Signed-off-by: ericharper <[email protected]>

* Update Aligner model and tutorial to add NGC checkpoint loading (NVIDIA#4714)

* Update Aligner model and tutorial to add NGC checkpoint loading

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix pynini install for Aligner notebook, minor formatting fix for model

Signed-off-by: Jocelyn Huang <[email protected]>

* Aligner notebook formatting consistency

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] bugfix for missing configs. (NVIDIA#4725)

Signed-off-by: Xuesong Yang <[email protected]>

* docs typo fix

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* Fix pynini install in TTS tutorials (NVIDIA#4729)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix ASR notebooks (NVIDIA#4738)

Signed-off-by: smajumdar <[email protected]>

Signed-off-by: smajumdar <[email protected]>

* Multilingual VAD model (NVIDIA#4734)

* add ngc link

Signed-off-by: fayejf <[email protected]>

* add tuned VAD config on ASR data

Signed-off-by: fayejf <[email protected]>

* yaml note

Signed-off-by: fayejf <[email protected]>

* update vad asr notebook with mVAD

Signed-off-by: fayejf <[email protected]>

* update vad infer config comment

Signed-off-by: fayejf <[email protected]>

* fix

Signed-off-by: fayejf <[email protected]>

* mvad sd config for ch109

Signed-off-by: fayejf <[email protected]>

* update sd readme

Signed-off-by: fayejf <[email protected]>

* add new mVAD model to doc

Signed-off-by: fayejf <[email protected]>

* style fix

Signed-off-by: fayejf <[email protected]>

* update sd tutorial with mVAD

Signed-off-by: fayejf <[email protected]>

* typo fix

Signed-off-by: fayejf <[email protected]>

Signed-off-by: fayejf <[email protected]>

* publish pretrained itn t5 model for English (NVIDIA#4748)

Signed-off-by: Alexandra Antonova <[email protected]>

Signed-off-by: Alexandra Antonova <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>

* Updated docs and doc paths (NVIDIA#4754)

* Updated docs and doc paths

Signed-off-by: Virginia Adams <[email protected]>

* Update Multitask_Prompt_and_PTuning.ipynb

* Update README.rst

* Changed branch name to use single quotes

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* fix bug relating to ddp strategy in joint intent slot classification tutorial (NVIDIA#4762)

* [TTS] updated config with a German IPA phoneme tokenizer (NVIDIA#4756)

* [TTS] added a German IPA phoneme tokenizer
* [TTS][ASR] enabled customized arguments for trimming the leading and trailing silence.
* [TTS] disabled spline interpolation for beta-binomial distribution. Let it generate align prior and save to disks. Use a new phoneme tokenizer.
* [TTS] use consistent spline interpolation with fastpitch checkpoint when generating mel-spectrograms for hifigan finetune.

Signed-off-by: Xuesong Yang <[email protected]>

* Update r1.11 to new heteronyms list (NVIDIA#4745)

* Update configs to new heteronyms list
* Remove old heteronyms list, add alt 'merchandise' pron to CMUdict
* Update remaining references to old heteronyms list

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4763)

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* [TTS] Add single male speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4770)

Signed-off-by: Xuesong Yang <[email protected]>

* Update CMUdict with more recent 0.7b entries (NVIDIA#4768)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* Install pynini in docker container (NVIDIA#4733)

Signed-off-by: Vladimir Bataev <[email protected]>

Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* Fix tutorial formatting (NVIDIA#4778)

Signed-off-by: Jocelyn Huang <[email protected]>

* [TTS] deprecated old scripts for ljspeech. (NVIDIA#4780)

* deprecated old scripts for ljspeech.
* removed relevent function calls in TTS docs.

Signed-off-by: Xuesong Yang <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and requirements

Signed-off-by: ericharper <[email protected]>

* update container

Signed-off-by: ericharper <[email protected]>

* Update stragglers to new cmudict and heteronyms paths

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Oleksii Kuchaiev <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Virginia Adams <[email protected]>
Signed-off-by: Jocelyn Huang <[email protected]>
Signed-off-by: fayejf <[email protected]>
Signed-off-by: Alexandra Antonova <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: Jason <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Rajesh Ilango <[email protected]>
Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Sandeep Subramanian <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Somshubra Majumdar <[email protected]>
Co-authored-by: Virginia Adams <[email protected]>
Co-authored-by: Jocelyn <[email protected]>
Co-authored-by: fayejf <[email protected]>
Co-authored-by: bene-ges <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>
Co-authored-by: Zhilin Wang <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Anas Abou Allaban <[email protected]>
jubick1337 pushed a commit to jubick1337/NeMo that referenced this pull request Oct 3, 2022
* NeMo Megatron doc updates

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and dockerfile

Signed-off-by: ericharper <[email protected]>

* fix fastpitch export (NVIDIA#4676)

Signed-off-by: Jason <[email protected]>

* [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677)

* [TTS] fixed wrong pronunciations.

Signed-off-by: Xuesong Yang <[email protected]>

* incremented the version number to 22.08 as @blisc suggested.

Signed-off-by: Xuesong Yang <[email protected]>

* correct cmudict versions in world-wide places.

Signed-off-by: Xuesong Yang <[email protected]>

* Fix for incorrect batch size issue while decoding (NVIDIA#4675)

Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* [TTS] incremented the version number to 22.08 in tutorials. (NVIDIA#4684)

* [TTS] incremented the version number to 22.08 in tutorials.

Signed-off-by: Xuesong Yang <[email protected]>

* Megatron encode function with RPE fix (NVIDIA#4692)

* Fix for RPE

Signed-off-by: MaximumEntropy <[email protected]>

* Style

Signed-off-by: MaximumEntropy <[email protected]>

* fix to fetch config file (NVIDIA#4699)

Signed-off-by: nithinraok <[email protected]>

* Fix notebook for buffered inference (NVIDIA#4703)

Signed-off-by: smajumdar <[email protected]>

* Prompt Learning Notebook Bug Fix (NVIDIA#4689)

* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* add psutils to mock imports (NVIDIA#4728)

Signed-off-by: ericharper <[email protected]>

Signed-off-by: ericharper <[email protected]>

* Update Aligner model and tutorial to add NGC checkpoint loading (NVIDIA#4714)

* Update Aligner model and tutorial to add NGC checkpoint loading

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix pynini install for Aligner notebook, minor formatting fix for model

Signed-off-by: Jocelyn Huang <[email protected]>

* Aligner notebook formatting consistency

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] bugfix for missing configs. (NVIDIA#4725)

Signed-off-by: Xuesong Yang <[email protected]>

* docs typo fix

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* Fix pynini install in TTS tutorials (NVIDIA#4729)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix ASR notebooks (NVIDIA#4738)

Signed-off-by: smajumdar <[email protected]>

Signed-off-by: smajumdar <[email protected]>

* Multilingual VAD model (NVIDIA#4734)

* add ngc link

Signed-off-by: fayejf <[email protected]>

* add tuned VAD config on ASR data

Signed-off-by: fayejf <[email protected]>

* yaml note

Signed-off-by: fayejf <[email protected]>

* update vad asr notebook with mVAD

Signed-off-by: fayejf <[email protected]>

* update vad infer config comment

Signed-off-by: fayejf <[email protected]>

* fix

Signed-off-by: fayejf <[email protected]>

* mvad sd config for ch109

Signed-off-by: fayejf <[email protected]>

* update sd readme

Signed-off-by: fayejf <[email protected]>

* add new mVAD model to doc

Signed-off-by: fayejf <[email protected]>

* style fix

Signed-off-by: fayejf <[email protected]>

* update sd tutorial with mVAD

Signed-off-by: fayejf <[email protected]>

* typo fix

Signed-off-by: fayejf <[email protected]>

Signed-off-by: fayejf <[email protected]>

* publish pretrained itn t5 model for English (NVIDIA#4748)

Signed-off-by: Alexandra Antonova <[email protected]>

Signed-off-by: Alexandra Antonova <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>

* Updated docs and doc paths (NVIDIA#4754)

* Updated docs and doc paths

Signed-off-by: Virginia Adams <[email protected]>

* Update Multitask_Prompt_and_PTuning.ipynb

* Update README.rst

* Changed branch name to use single quotes

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* fix bug relating to ddp strategy in joint intent slot classification tutorial (NVIDIA#4762)

* [TTS] updated config with a German IPA phoneme tokenizer (NVIDIA#4756)

* [TTS] added a German IPA phoneme tokenizer
* [TTS][ASR] enabled customized arguments for trimming the leading and trailing silence.
* [TTS] disabled spline interpolation for beta-binomial distribution. Let it generate align prior and save to disks. Use a new phoneme tokenizer.
* [TTS] use consistent spline interpolation with fastpitch checkpoint when generating mel-spectrograms for hifigan finetune.

Signed-off-by: Xuesong Yang <[email protected]>

* Update r1.11 to new heteronyms list (NVIDIA#4745)

* Update configs to new heteronyms list
* Remove old heteronyms list, add alt 'merchandise' pron to CMUdict
* Update remaining references to old heteronyms list

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4763)

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* [TTS] Add single male speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4770)

Signed-off-by: Xuesong Yang <[email protected]>

* Update CMUdict with more recent 0.7b entries (NVIDIA#4768)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* Install pynini in docker container (NVIDIA#4733)

Signed-off-by: Vladimir Bataev <[email protected]>

Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* Fix tutorial formatting (NVIDIA#4778)

Signed-off-by: Jocelyn Huang <[email protected]>

* [TTS] deprecated old scripts for ljspeech. (NVIDIA#4780)

* deprecated old scripts for ljspeech.
* removed relevent function calls in TTS docs.

Signed-off-by: Xuesong Yang <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and requirements

Signed-off-by: ericharper <[email protected]>

* update container

Signed-off-by: ericharper <[email protected]>

* Update stragglers to new cmudict and heteronyms paths

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Oleksii Kuchaiev <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Virginia Adams <[email protected]>
Signed-off-by: Jocelyn Huang <[email protected]>
Signed-off-by: fayejf <[email protected]>
Signed-off-by: Alexandra Antonova <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: Jason <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Rajesh Ilango <[email protected]>
Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Sandeep Subramanian <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Somshubra Majumdar <[email protected]>
Co-authored-by: Virginia Adams <[email protected]>
Co-authored-by: Jocelyn <[email protected]>
Co-authored-by: fayejf <[email protected]>
Co-authored-by: bene-ges <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>
Co-authored-by: Zhilin Wang <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Matvei Novikov <[email protected]>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 29, 2022
* NeMo Megatron doc updates

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and dockerfile

Signed-off-by: ericharper <[email protected]>

* fix fastpitch export (NVIDIA#4676)

Signed-off-by: Jason <[email protected]>

* [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677)

* [TTS] fixed wrong pronunciations.

Signed-off-by: Xuesong Yang <[email protected]>

* incremented the version number to 22.08 as @blisc suggested.

Signed-off-by: Xuesong Yang <[email protected]>

* correct cmudict versions in world-wide places.

Signed-off-by: Xuesong Yang <[email protected]>

* Fix for incorrect batch size issue while decoding (NVIDIA#4675)

Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* [TTS] incremented the version number to 22.08 in tutorials. (NVIDIA#4684)

* [TTS] incremented the version number to 22.08 in tutorials.

Signed-off-by: Xuesong Yang <[email protected]>

* Megatron encode function with RPE fix (NVIDIA#4692)

* Fix for RPE

Signed-off-by: MaximumEntropy <[email protected]>

* Style

Signed-off-by: MaximumEntropy <[email protected]>

* fix to fetch config file (NVIDIA#4699)

Signed-off-by: nithinraok <[email protected]>

* Fix notebook for buffered inference (NVIDIA#4703)

Signed-off-by: smajumdar <[email protected]>

* Prompt Learning Notebook Bug Fix (NVIDIA#4689)

* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* add psutils to mock imports (NVIDIA#4728)

Signed-off-by: ericharper <[email protected]>

Signed-off-by: ericharper <[email protected]>

* Update Aligner model and tutorial to add NGC checkpoint loading (NVIDIA#4714)

* Update Aligner model and tutorial to add NGC checkpoint loading

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix pynini install for Aligner notebook, minor formatting fix for model

Signed-off-by: Jocelyn Huang <[email protected]>

* Aligner notebook formatting consistency

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] bugfix for missing configs. (NVIDIA#4725)

Signed-off-by: Xuesong Yang <[email protected]>

* docs typo fix

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* Fix pynini install in TTS tutorials (NVIDIA#4729)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix ASR notebooks (NVIDIA#4738)

Signed-off-by: smajumdar <[email protected]>

Signed-off-by: smajumdar <[email protected]>

* Multilingual VAD model (NVIDIA#4734)

* add ngc link

Signed-off-by: fayejf <[email protected]>

* add tuned VAD config on ASR data

Signed-off-by: fayejf <[email protected]>

* yaml note

Signed-off-by: fayejf <[email protected]>

* update vad asr notebook with mVAD

Signed-off-by: fayejf <[email protected]>

* update vad infer config comment

Signed-off-by: fayejf <[email protected]>

* fix

Signed-off-by: fayejf <[email protected]>

* mvad sd config for ch109

Signed-off-by: fayejf <[email protected]>

* update sd readme

Signed-off-by: fayejf <[email protected]>

* add new mVAD model to doc

Signed-off-by: fayejf <[email protected]>

* style fix

Signed-off-by: fayejf <[email protected]>

* update sd tutorial with mVAD

Signed-off-by: fayejf <[email protected]>

* typo fix

Signed-off-by: fayejf <[email protected]>

Signed-off-by: fayejf <[email protected]>

* publish pretrained itn t5 model for English (NVIDIA#4748)

Signed-off-by: Alexandra Antonova <[email protected]>

Signed-off-by: Alexandra Antonova <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>

* Updated docs and doc paths (NVIDIA#4754)

* Updated docs and doc paths

Signed-off-by: Virginia Adams <[email protected]>

* Update Multitask_Prompt_and_PTuning.ipynb

* Update README.rst

* Changed branch name to use single quotes

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* fix bug relating to ddp strategy in joint intent slot classification tutorial (NVIDIA#4762)

* [TTS] updated config with a German IPA phoneme tokenizer (NVIDIA#4756)

* [TTS] added a German IPA phoneme tokenizer
* [TTS][ASR] enabled customized arguments for trimming the leading and trailing silence.
* [TTS] disabled spline interpolation for beta-binomial distribution. Let it generate align prior and save to disks. Use a new phoneme tokenizer.
* [TTS] use consistent spline interpolation with fastpitch checkpoint when generating mel-spectrograms for hifigan finetune.

Signed-off-by: Xuesong Yang <[email protected]>

* Update r1.11 to new heteronyms list (NVIDIA#4745)

* Update configs to new heteronyms list
* Remove old heteronyms list, add alt 'merchandise' pron to CMUdict
* Update remaining references to old heteronyms list

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4763)

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* [TTS] Add single male speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4770)

Signed-off-by: Xuesong Yang <[email protected]>

* Update CMUdict with more recent 0.7b entries (NVIDIA#4768)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* Install pynini in docker container (NVIDIA#4733)

Signed-off-by: Vladimir Bataev <[email protected]>

Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* Fix tutorial formatting (NVIDIA#4778)

Signed-off-by: Jocelyn Huang <[email protected]>

* [TTS] deprecated old scripts for ljspeech. (NVIDIA#4780)

* deprecated old scripts for ljspeech.
* removed relevent function calls in TTS docs.

Signed-off-by: Xuesong Yang <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and requirements

Signed-off-by: ericharper <[email protected]>

* update container

Signed-off-by: ericharper <[email protected]>

* Update stragglers to new cmudict and heteronyms paths

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Oleksii Kuchaiev <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Virginia Adams <[email protected]>
Signed-off-by: Jocelyn Huang <[email protected]>
Signed-off-by: fayejf <[email protected]>
Signed-off-by: Alexandra Antonova <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: Jason <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Rajesh Ilango <[email protected]>
Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Sandeep Subramanian <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Somshubra Majumdar <[email protected]>
Co-authored-by: Virginia Adams <[email protected]>
Co-authored-by: Jocelyn <[email protected]>
Co-authored-by: fayejf <[email protected]>
Co-authored-by: bene-ges <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>
Co-authored-by: Zhilin Wang <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 29, 2022
* NeMo Megatron doc updates

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and dockerfile

Signed-off-by: ericharper <[email protected]>

* fix fastpitch export (NVIDIA#4676)

Signed-off-by: Jason <[email protected]>

* [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677)

* [TTS] fixed wrong pronunciations.

Signed-off-by: Xuesong Yang <[email protected]>

* incremented the version number to 22.08 as @blisc suggested.

Signed-off-by: Xuesong Yang <[email protected]>

* correct cmudict versions in world-wide places.

Signed-off-by: Xuesong Yang <[email protected]>

* Fix for incorrect batch size issue while decoding (NVIDIA#4675)

Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* [TTS] incremented the version number to 22.08 in tutorials. (NVIDIA#4684)

* [TTS] incremented the version number to 22.08 in tutorials.

Signed-off-by: Xuesong Yang <[email protected]>

* Megatron encode function with RPE fix (NVIDIA#4692)

* Fix for RPE

Signed-off-by: MaximumEntropy <[email protected]>

* Style

Signed-off-by: MaximumEntropy <[email protected]>

* fix to fetch config file (NVIDIA#4699)

Signed-off-by: nithinraok <[email protected]>

* Fix notebook for buffered inference (NVIDIA#4703)

Signed-off-by: smajumdar <[email protected]>

* Prompt Learning Notebook Bug Fix (NVIDIA#4689)

* Added back dataset class list of dict input for generation in tutorial notebook

Signed-off-by: Virginia Adams <[email protected]>

* updated argument name for build dataset

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* add psutils to mock imports (NVIDIA#4728)

Signed-off-by: ericharper <[email protected]>

Signed-off-by: ericharper <[email protected]>

* Update Aligner model and tutorial to add NGC checkpoint loading (NVIDIA#4714)

* Update Aligner model and tutorial to add NGC checkpoint loading

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix pynini install for Aligner notebook, minor formatting fix for model

Signed-off-by: Jocelyn Huang <[email protected]>

* Aligner notebook formatting consistency

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] bugfix for missing configs. (NVIDIA#4725)

Signed-off-by: Xuesong Yang <[email protected]>

* docs typo fix

Signed-off-by: Oleksii Kuchaiev <[email protected]>

* Fix pynini install in TTS tutorials (NVIDIA#4729)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>

* Fix ASR notebooks (NVIDIA#4738)

Signed-off-by: smajumdar <[email protected]>

Signed-off-by: smajumdar <[email protected]>

* Multilingual VAD model (NVIDIA#4734)

* add ngc link

Signed-off-by: fayejf <[email protected]>

* add tuned VAD config on ASR data

Signed-off-by: fayejf <[email protected]>

* yaml note

Signed-off-by: fayejf <[email protected]>

* update vad asr notebook with mVAD

Signed-off-by: fayejf <[email protected]>

* update vad infer config comment

Signed-off-by: fayejf <[email protected]>

* fix

Signed-off-by: fayejf <[email protected]>

* mvad sd config for ch109

Signed-off-by: fayejf <[email protected]>

* update sd readme

Signed-off-by: fayejf <[email protected]>

* add new mVAD model to doc

Signed-off-by: fayejf <[email protected]>

* style fix

Signed-off-by: fayejf <[email protected]>

* update sd tutorial with mVAD

Signed-off-by: fayejf <[email protected]>

* typo fix

Signed-off-by: fayejf <[email protected]>

Signed-off-by: fayejf <[email protected]>

* publish pretrained itn t5 model for English (NVIDIA#4748)

Signed-off-by: Alexandra Antonova <[email protected]>

Signed-off-by: Alexandra Antonova <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>

* Updated docs and doc paths (NVIDIA#4754)

* Updated docs and doc paths

Signed-off-by: Virginia Adams <[email protected]>

* Update Multitask_Prompt_and_PTuning.ipynb

* Update README.rst

* Changed branch name to use single quotes

Signed-off-by: Virginia Adams <[email protected]>

Signed-off-by: Virginia Adams <[email protected]>

* fix bug relating to ddp strategy in joint intent slot classification tutorial (NVIDIA#4762)

* [TTS] updated config with a German IPA phoneme tokenizer (NVIDIA#4756)

* [TTS] added a German IPA phoneme tokenizer
* [TTS][ASR] enabled customized arguments for trimming the leading and trailing silence.
* [TTS] disabled spline interpolation for beta-binomial distribution. Let it generate align prior and save to disks. Use a new phoneme tokenizer.
* [TTS] use consistent spline interpolation with fastpitch checkpoint when generating mel-spectrograms for hifigan finetune.

Signed-off-by: Xuesong Yang <[email protected]>

* Update r1.11 to new heteronyms list (NVIDIA#4745)

* Update configs to new heteronyms list
* Remove old heteronyms list, add alt 'merchandise' pron to CMUdict
* Update remaining references to old heteronyms list

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* [TTS] Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4763)

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* [TTS] Add single male speaker German FastPitch and HiFiGAN NGC checkpoints (NVIDIA#4770)

Signed-off-by: Xuesong Yang <[email protected]>

* Update CMUdict with more recent 0.7b entries (NVIDIA#4768)

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Jocelyn Huang <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>

* Install pynini in docker container (NVIDIA#4733)

Signed-off-by: Vladimir Bataev <[email protected]>

Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Eric Harper <[email protected]>

* Fix tutorial formatting (NVIDIA#4778)

Signed-off-by: Jocelyn Huang <[email protected]>

* [TTS] deprecated old scripts for ljspeech. (NVIDIA#4780)

* deprecated old scripts for ljspeech.
* removed relevent function calls in TTS docs.

Signed-off-by: Xuesong Yang <[email protected]>

* update branch

Signed-off-by: ericharper <[email protected]>

* update package info and requirements

Signed-off-by: ericharper <[email protected]>

* update container

Signed-off-by: ericharper <[email protected]>

* Update stragglers to new cmudict and heteronyms paths

Signed-off-by: Jocelyn Huang <[email protected]>

Signed-off-by: Oleksii Kuchaiev <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Virginia Adams <[email protected]>
Signed-off-by: Jocelyn Huang <[email protected]>
Signed-off-by: fayejf <[email protected]>
Signed-off-by: Alexandra Antonova <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: Jason <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Rajesh Ilango <[email protected]>
Co-authored-by: Micha Livne <[email protected]>
Co-authored-by: Sandeep Subramanian <[email protected]>
Co-authored-by: Nithin Rao <[email protected]>
Co-authored-by: Somshubra Majumdar <[email protected]>
Co-authored-by: Virginia Adams <[email protected]>
Co-authored-by: Jocelyn <[email protected]>
Co-authored-by: fayejf <[email protected]>
Co-authored-by: bene-ges <[email protected]>
Co-authored-by: Alexandra Antonova <[email protected]>
Co-authored-by: Zhilin Wang <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants