Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explicitly check for united embeddings when logging params #6085

Merged
merged 2 commits into from
Feb 26, 2023

Conversation

MaximumEntropy
Copy link
Contributor

What does this PR do ?

Fixes a bug with logging params with PP > 1.

Collection: NLP

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@MaximumEntropy MaximumEntropy added bug Something isn't working NLP labels Feb 22, 2023
Copy link
Collaborator

@yidong72 yidong72 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@MaximumEntropy MaximumEntropy merged commit aef821e into main Feb 26, 2023
@MaximumEntropy MaximumEntropy deleted the sandeepsub/fix_gpt_param_logging branch February 26, 2023 02:47
MaximumEntropy added a commit that referenced this pull request Mar 4, 2023
* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
barry-jin pushed a commit to barry-jin/NeMo that referenced this pull request Mar 8, 2023
* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
titu1994 pushed a commit to titu1994/NeMo that referenced this pull request Mar 24, 2023
* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
ericharper added a commit that referenced this pull request Apr 6, 2023
* copy from sft_from_gpt

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Changed tokenization and example

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* maybe remove (got from upstream)

* Eval metrics while finetuning

Signed-off-by: MaximumEntropy <[email protected]>

* Add missing args

Signed-off-by: MaximumEntropy <[email protected]>

* Add arg

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Wrap in try except

Signed-off-by: MaximumEntropy <[email protected]>

* Try fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add separate validation and test batch sizes

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add assert

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix checkpoint name

Signed-off-by: MaximumEntropy <[email protected]>

* Explict sampling args

Signed-off-by: MaximumEntropy <[email protected]>

* Update t0 script

Signed-off-by: MaximumEntropy <[email protected]>

* Add niv2 script

Signed-off-by: MaximumEntropy <[email protected]>

* Change workers

Signed-off-by: MaximumEntropy <[email protected]>

* Fix labels

Signed-off-by: MaximumEntropy <[email protected]>

* Ignore download

Signed-off-by: MaximumEntropy <[email protected]>

* Minor fixes

Signed-off-by: MaximumEntropy <[email protected]>

* Add dist opt support

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Minor

Signed-off-by: MaximumEntropy <[email protected]>

* Allow skipping validation

Signed-off-by: MaximumEntropy <[email protected]>

* Fix tokenization and padding to max batch

Signed-off-by: MaximumEntropy <[email protected]>

* Adds several configurable flags for Megatron GPT models (#5991)

* Initial

Signed-off-by: MaximumEntropy <[email protected]>

* Multiple fixes

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add to CI test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* check position embs for gpt prompt learning

Signed-off-by: Adi Renduchintala <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update args

Signed-off-by: MaximumEntropy <[email protected]>

* Disable tts unit test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Empty

Signed-off-by: MaximumEntropy <[email protected]>

* Update Jenkinsfile

Changed optimizer for GPT training from 'fused_adam' to 'distributed_fused_adam'.

Signed-off-by: khcs <[email protected]>

* update config to to use correct key

Signed-off-by: ericharper <[email protected]>

* revert Jenkinsfile back to fused_adam

Signed-off-by: ericharper <[email protected]>

---------

Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Adi Renduchintala <[email protected]>
Signed-off-by: khcs <[email protected]>
Signed-off-by: ericharper <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adi Renduchintala <[email protected]>
Co-authored-by: khcs <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: ericharper <[email protected]>

* Fast glu activations (#6058)

* fast glu activations

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Clean up activation list

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Explicitly check for united embeddings when logging params (#6085)

* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Option for model extracted dir

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add index mapping dir

Signed-off-by: MaximumEntropy <[email protected]>

* Assistant prompt

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Remove ipdb

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Override dropout

Signed-off-by: MaximumEntropy <[email protected]>

* Change sampler

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Roll back again

Signed-off-by: MaximumEntropy <[email protected]>

* Revert TTS

Signed-off-by: MaximumEntropy <[email protected]>

* Reset TTS

Signed-off-by: MaximumEntropy <[email protected]>

* Revert further

Signed-off-by: MaximumEntropy <[email protected]>

* Revert more to main

Signed-off-by: MaximumEntropy <[email protected]>

* Fix Test DS

Signed-off-by: MaximumEntropy <[email protected]>

* Address PR comments

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add the option to provide a prompt template via fstrings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add CI test

Signed-off-by: MaximumEntropy <[email protected]>

* fix ci test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI test

Signed-off-by: MaximumEntropy <[email protected]>

* Minor

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix workers issue

Signed-off-by: MaximumEntropy <[email protected]>

* Fix workers

Signed-off-by: MaximumEntropy <[email protected]>

---------

Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Adi Renduchintala <[email protected]>
Signed-off-by: khcs <[email protected]>
Signed-off-by: ericharper <[email protected]>
Co-authored-by: soares-f <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adi Renduchintala <[email protected]>
Co-authored-by: khcs <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: ericharper <[email protected]>
hsiehjackson pushed a commit to hsiehjackson/NeMo that referenced this pull request Jun 2, 2023
* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: hsiehjackson <[email protected]>
hsiehjackson pushed a commit to hsiehjackson/NeMo that referenced this pull request Jun 2, 2023
* copy from sft_from_gpt

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Changed tokenization and example

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* maybe remove (got from upstream)

* Eval metrics while finetuning

Signed-off-by: MaximumEntropy <[email protected]>

* Add missing args

Signed-off-by: MaximumEntropy <[email protected]>

* Add arg

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Wrap in try except

Signed-off-by: MaximumEntropy <[email protected]>

* Try fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add separate validation and test batch sizes

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add assert

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix checkpoint name

Signed-off-by: MaximumEntropy <[email protected]>

* Explict sampling args

Signed-off-by: MaximumEntropy <[email protected]>

* Update t0 script

Signed-off-by: MaximumEntropy <[email protected]>

* Add niv2 script

Signed-off-by: MaximumEntropy <[email protected]>

* Change workers

Signed-off-by: MaximumEntropy <[email protected]>

* Fix labels

Signed-off-by: MaximumEntropy <[email protected]>

* Ignore download

Signed-off-by: MaximumEntropy <[email protected]>

* Minor fixes

Signed-off-by: MaximumEntropy <[email protected]>

* Add dist opt support

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Minor

Signed-off-by: MaximumEntropy <[email protected]>

* Allow skipping validation

Signed-off-by: MaximumEntropy <[email protected]>

* Fix tokenization and padding to max batch

Signed-off-by: MaximumEntropy <[email protected]>

* Adds several configurable flags for Megatron GPT models (NVIDIA#5991)

* Initial

Signed-off-by: MaximumEntropy <[email protected]>

* Multiple fixes

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add to CI test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* check position embs for gpt prompt learning

Signed-off-by: Adi Renduchintala <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update args

Signed-off-by: MaximumEntropy <[email protected]>

* Disable tts unit test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Empty

Signed-off-by: MaximumEntropy <[email protected]>

* Update Jenkinsfile

Changed optimizer for GPT training from 'fused_adam' to 'distributed_fused_adam'.

Signed-off-by: khcs <[email protected]>

* update config to to use correct key

Signed-off-by: ericharper <[email protected]>

* revert Jenkinsfile back to fused_adam

Signed-off-by: ericharper <[email protected]>

---------

Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Adi Renduchintala <[email protected]>
Signed-off-by: khcs <[email protected]>
Signed-off-by: ericharper <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adi Renduchintala <[email protected]>
Co-authored-by: khcs <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: ericharper <[email protected]>

* Fast glu activations (NVIDIA#6058)

* fast glu activations

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Clean up activation list

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Explicitly check for united embeddings when logging params (NVIDIA#6085)

* Explicitly check for united embeddings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: MaximumEntropy <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Option for model extracted dir

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Add index mapping dir

Signed-off-by: MaximumEntropy <[email protected]>

* Assistant prompt

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Remove ipdb

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Override dropout

Signed-off-by: MaximumEntropy <[email protected]>

* Change sampler

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Roll back again

Signed-off-by: MaximumEntropy <[email protected]>

* Revert TTS

Signed-off-by: MaximumEntropy <[email protected]>

* Reset TTS

Signed-off-by: MaximumEntropy <[email protected]>

* Revert further

Signed-off-by: MaximumEntropy <[email protected]>

* Revert more to main

Signed-off-by: MaximumEntropy <[email protected]>

* Fix Test DS

Signed-off-by: MaximumEntropy <[email protected]>

* Address PR comments

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add the option to provide a prompt template via fstrings

Signed-off-by: MaximumEntropy <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add CI test

Signed-off-by: MaximumEntropy <[email protected]>

* fix ci test

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI test

Signed-off-by: MaximumEntropy <[email protected]>

* Minor

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix

Signed-off-by: MaximumEntropy <[email protected]>

* Fix CI

Signed-off-by: MaximumEntropy <[email protected]>

* Fix workers issue

Signed-off-by: MaximumEntropy <[email protected]>

* Fix workers

Signed-off-by: MaximumEntropy <[email protected]>

---------

Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Adi Renduchintala <[email protected]>
Signed-off-by: khcs <[email protected]>
Signed-off-by: ericharper <[email protected]>
Co-authored-by: soares-f <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adi Renduchintala <[email protected]>
Co-authored-by: khcs <[email protected]>
Co-authored-by: Oleksii Kuchaiev <[email protected]>
Co-authored-by: ericharper <[email protected]>
Signed-off-by: hsiehjackson <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working NLP
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants