-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prompt learning of Huggingface T5v1.1 converted checkpoints #4746
Conversation
Signed-off-by: ericharper <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
* [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]>
Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: ericharper <[email protected]>
Signed-off-by: Jason <[email protected]>
* [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]>
Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]>
…Mo into megatron_t51_1_compat
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Abhinav Khattar <[email protected]>
Signed-off-by: Abhinav Khattar <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
This pull request introduces 3 alerts when merging a724c34 into a71712b - view on LGTM.com new alerts:
|
Signed-off-by: MaximumEntropy <[email protected]>
This pull request introduces 3 alerts when merging cd31acc into 0d5ed9b - view on LGTM.com new alerts:
|
Signed-off-by: MaximumEntropy <[email protected]>
This pull request introduces 3 alerts when merging 79543ea into cfd3682 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts and fixes 2 when merging ae46376 into bd377b7 - view on LGTM.com new alerts:
fixed alerts:
|
Signed-off-by: MaximumEntropy <[email protected]>
This pull request introduces 2 alerts and fixes 2 when merging ac9dcf6 into 4366699 - view on LGTM.com new alerts:
fixed alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great! Just left a few comments
nemo/collections/nlp/models/language_modeling/megatron_finetune_model.py
Outdated
Show resolved
Hide resolved
|
||
# Add BOS/EOS to the input of encoder if desired, adds EOS by default | ||
if self.ul2_prompt_token is not None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should probably add the ul2_prompt_token
after the virtual prompt tokens if there are some at the beginning of the text input.
nemo/collections/nlp/models/language_modeling/megatron_t5_adapter_model.py
Show resolved
Hide resolved
'predicted_token_ids': processed_preds, | ||
'log_probs': log_probs, | ||
'labels': processed_labels, | ||
'input_text': input_text, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some down stream code might expect these key values, we should check with @arendu that this doesn't break anything he is aware of.
Signed-off-by: MaximumEntropy <[email protected]>
for more information, see https://pre-commit.ci
This pull request introduces 3 alerts and fixes 2 when merging 203c67b into 4fc5385 - view on LGTM.com new alerts:
fixed alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
) * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Initial Signed-off-by: MaximumEntropy <[email protected]> * Fix for RPE Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Make megatron legacy configurable Signed-off-by: MaximumEntropy <[email protected]> * Enc-Dec checksum matching Signed-off-by: MaximumEntropy <[email protected]> * Add conversion script Signed-off-by: MaximumEntropy <[email protected]> * Reset files Signed-off-by: MaximumEntropy <[email protected]> * Reset docker and jenkinsfile Signed-off-by: MaximumEntropy <[email protected]> * Reset README Signed-off-by: MaximumEntropy <[email protected]> * Remove tts scripts files Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update finetuning script Signed-off-by: MaximumEntropy <[email protected]> * add cloning Signed-off-by: Abhinav Khattar <[email protected]> * map to cpu Signed-off-by: Abhinav Khattar <[email protected]> * Fix TP change for HF exported models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update conversion script and style Signed-off-by: MaximumEntropy <[email protected]> * Add base config Signed-off-by: MaximumEntropy <[email protected]> * Add arg Signed-off-by: MaximumEntropy <[email protected]> * Change partition comment update Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Minor fix for prompt learning Signed-off-by: MaximumEntropy <[email protected]> * style Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix default Signed-off-by: MaximumEntropy <[email protected]> * Fix to latest ptl Signed-off-by: MaximumEntropy <[email protected]> * Add arg to perceiver Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Temporarily add Signed-off-by: MaximumEntropy <[email protected]> * Restore Signed-off-by: MaximumEntropy <[email protected]> * Move tokens head bias to cfg population Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes to get decode to work. Signed-off-by: MaximumEntropy <[email protected]> * More changes Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Test Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update config to 0 dropout Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Reset file Signed-off-by: MaximumEntropy <[email protected]> * Remove scheduler Signed-off-by: MaximumEntropy <[email protected]> * Changes Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Support generic bos id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * Add embedding dropout Signed-off-by: MaximumEntropy <[email protected]> * Changes for ul2 Signed-off-by: MaximumEntropy <[email protected]> * Fix for pad id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update models that can be converted Signed-off-by: MaximumEntropy <[email protected]> * Fix inference Signed-off-by: MaximumEntropy <[email protected]> * Remove ipdb Signed-off-by: MaximumEntropy <[email protected]> * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Load ul2 in bf16 Signed-off-by: MaximumEntropy <[email protected]> * Add amp o2 arg Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Tmp Signed-off-by: MaximumEntropy <[email protected]> * Fix rmsnorm Signed-off-by: MaximumEntropy <[email protected]> * Reset config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix eval for converted models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update predict step for adapters Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: ericharper <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Rajesh Ilango <[email protected]> Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: 1-800-bad-code <[email protected]>
) * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Initial Signed-off-by: MaximumEntropy <[email protected]> * Fix for RPE Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Make megatron legacy configurable Signed-off-by: MaximumEntropy <[email protected]> * Enc-Dec checksum matching Signed-off-by: MaximumEntropy <[email protected]> * Add conversion script Signed-off-by: MaximumEntropy <[email protected]> * Reset files Signed-off-by: MaximumEntropy <[email protected]> * Reset docker and jenkinsfile Signed-off-by: MaximumEntropy <[email protected]> * Reset README Signed-off-by: MaximumEntropy <[email protected]> * Remove tts scripts files Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update finetuning script Signed-off-by: MaximumEntropy <[email protected]> * add cloning Signed-off-by: Abhinav Khattar <[email protected]> * map to cpu Signed-off-by: Abhinav Khattar <[email protected]> * Fix TP change for HF exported models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update conversion script and style Signed-off-by: MaximumEntropy <[email protected]> * Add base config Signed-off-by: MaximumEntropy <[email protected]> * Add arg Signed-off-by: MaximumEntropy <[email protected]> * Change partition comment update Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Minor fix for prompt learning Signed-off-by: MaximumEntropy <[email protected]> * style Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix default Signed-off-by: MaximumEntropy <[email protected]> * Fix to latest ptl Signed-off-by: MaximumEntropy <[email protected]> * Add arg to perceiver Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Temporarily add Signed-off-by: MaximumEntropy <[email protected]> * Restore Signed-off-by: MaximumEntropy <[email protected]> * Move tokens head bias to cfg population Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes to get decode to work. Signed-off-by: MaximumEntropy <[email protected]> * More changes Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Test Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update config to 0 dropout Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Reset file Signed-off-by: MaximumEntropy <[email protected]> * Remove scheduler Signed-off-by: MaximumEntropy <[email protected]> * Changes Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Support generic bos id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * Add embedding dropout Signed-off-by: MaximumEntropy <[email protected]> * Changes for ul2 Signed-off-by: MaximumEntropy <[email protected]> * Fix for pad id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update models that can be converted Signed-off-by: MaximumEntropy <[email protected]> * Fix inference Signed-off-by: MaximumEntropy <[email protected]> * Remove ipdb Signed-off-by: MaximumEntropy <[email protected]> * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Load ul2 in bf16 Signed-off-by: MaximumEntropy <[email protected]> * Add amp o2 arg Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Tmp Signed-off-by: MaximumEntropy <[email protected]> * Fix rmsnorm Signed-off-by: MaximumEntropy <[email protected]> * Reset config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix eval for converted models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update predict step for adapters Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: ericharper <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Rajesh Ilango <[email protected]> Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Hainan Xu <[email protected]>
) * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Initial Signed-off-by: MaximumEntropy <[email protected]> * Fix for RPE Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * update package info and dockerfile Signed-off-by: ericharper <[email protected]> * fix fastpitch export (NVIDIA#4676) Signed-off-by: Jason <[email protected]> * [TTS] fixed wrong pronunciations for r1.11. (NVIDIA#4677) * [TTS] fixed wrong pronunciations. Signed-off-by: Xuesong Yang <[email protected]> * incremented the version number to 22.08 as @blisc suggested. Signed-off-by: Xuesong Yang <[email protected]> * correct cmudict versions in world-wide places. Signed-off-by: Xuesong Yang <[email protected]> * Fix for incorrect batch size issue while decoding (NVIDIA#4675) Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Eric Harper <[email protected]> * Make megatron legacy configurable Signed-off-by: MaximumEntropy <[email protected]> * Enc-Dec checksum matching Signed-off-by: MaximumEntropy <[email protected]> * Add conversion script Signed-off-by: MaximumEntropy <[email protected]> * Reset files Signed-off-by: MaximumEntropy <[email protected]> * Reset docker and jenkinsfile Signed-off-by: MaximumEntropy <[email protected]> * Reset README Signed-off-by: MaximumEntropy <[email protected]> * Remove tts scripts files Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update finetuning script Signed-off-by: MaximumEntropy <[email protected]> * add cloning Signed-off-by: Abhinav Khattar <[email protected]> * map to cpu Signed-off-by: Abhinav Khattar <[email protected]> * Fix TP change for HF exported models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update conversion script and style Signed-off-by: MaximumEntropy <[email protected]> * Add base config Signed-off-by: MaximumEntropy <[email protected]> * Add arg Signed-off-by: MaximumEntropy <[email protected]> * Change partition comment update Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Minor fix for prompt learning Signed-off-by: MaximumEntropy <[email protected]> * style Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix default Signed-off-by: MaximumEntropy <[email protected]> * Fix to latest ptl Signed-off-by: MaximumEntropy <[email protected]> * Add arg to perceiver Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Temporarily add Signed-off-by: MaximumEntropy <[email protected]> * Restore Signed-off-by: MaximumEntropy <[email protected]> * Move tokens head bias to cfg population Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes to get decode to work. Signed-off-by: MaximumEntropy <[email protected]> * More changes Signed-off-by: MaximumEntropy <[email protected]> * Update base config Signed-off-by: MaximumEntropy <[email protected]> * Test Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update config to 0 dropout Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Reset file Signed-off-by: MaximumEntropy <[email protected]> * Remove scheduler Signed-off-by: MaximumEntropy <[email protected]> * Changes Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Support generic bos id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * Add embedding dropout Signed-off-by: MaximumEntropy <[email protected]> * Changes for ul2 Signed-off-by: MaximumEntropy <[email protected]> * Fix for pad id Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Update models that can be converted Signed-off-by: MaximumEntropy <[email protected]> * Fix inference Signed-off-by: MaximumEntropy <[email protected]> * Remove ipdb Signed-off-by: MaximumEntropy <[email protected]> * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Load ul2 in bf16 Signed-off-by: MaximumEntropy <[email protected]> * Add amp o2 arg Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Tmp Signed-off-by: MaximumEntropy <[email protected]> * Fix rmsnorm Signed-off-by: MaximumEntropy <[email protected]> * Reset config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix eval for converted models Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Style Signed-off-by: MaximumEntropy <[email protected]> * Update predict step for adapters Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: ericharper <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Rajesh Ilango <[email protected]> Co-authored-by: Micha Livne <[email protected]> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Hainan Xu <[email protected]>
What does this PR do ?
Makes the changes necessary to do prompt learning of T5v1.1-converted checkpoints from HF.
Collection: NLP
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information