-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix triton import guards #8552
Fix triton import guards #8552
Conversation
5e82e5e
to
0dff090
Compare
jenkins |
Signed-off-by: Michal Futrega <[email protected]>
0dff090
to
ec9671d
Compare
jenkins |
@michal2409 as far as I know we install |
You're right. I checked the DLFW containers and they do have The other outstanding issue is - what is the expected behavior if |
If triton is not installed, we'll get a |
I don't think this is true. I'm getting:
which is not being handled. Could you please double check @hsiehjackson ? Maybe I'm doing something wrong EDIT: |
When my container install
|
I don't know, it works for me. But sure, if you need |
Sure! @michal2409 Can you help me revert the first line |
Signed-off-by: Michal Futrega <[email protected]>
@hsiehjackson done |
jenkins |
jenkins |
1 similar comment
jenkins |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
jenkins |
jenkins |
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Zeeshan Patel <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: duongvdo <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Agoniii <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: ataghibakhsh <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Pablo Garay <[email protected]>
…t-only) dataloading (#8581) * wip Signed-off-by: Piotr Żelasko <[email protected]> * Partially working config groups Signed-off-by: Piotr Żelasko <[email protected]> * Working test with abasic group in the input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with nested groups in input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with specifying a YAML path for input_cfg Signed-off-by: Piotr Żelasko <[email protected]> * a very rough example of text dataloading via lhotse Signed-off-by: Piotr Żelasko <[email protected]> * Cleaner integration of multimodal audio/text loading that allows to control the effective audio vs text size (requires latest lhotse) Signed-off-by: Piotr Żelasko <[email protected]> * remove obsolete test Signed-off-by: Piotr Żelasko <[email protected]> * Fix an import in export_utils.py (#8571) Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Yttm deprecation (#8322) * yttm deprecation init commit Signed-off-by: AlexGrinch <[email protected]> * removed tests Signed-off-by: AlexGrinch <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix Signed-off-by: AlexGrinch <[email protected]> * path fix Signed-off-by: AlexGrinch <[email protected]> * fixing path Signed-off-by: AlexGrinch <[email protected]> * updated tests to spm Signed-off-by: AlexGrinch <[email protected]> * updated Jenkinsfile Signed-off-by: AlexGrinch <[email protected]> * new model with spm in tests Signed-off-by: AlexGrinch <[email protected]> * yttm removed Signed-off-by: AlexGrinch <[email protected]> * updated aayn config Signed-off-by: AlexGrinch <[email protected]> --------- Signed-off-by: AlexGrinch <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed missing copy import in rnnt_decoder.py (#8580) * Added copy import to rnnt_decoding.py Signed-off-by: Isaac McFadyen <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Isaac McFadyen <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fix bug in RNNT Joint WER calculation for fused batch (#8587) Signed-off-by: smajumdar <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed Context Parallel HtoD sync (#8557) * Fixed cp HtoD sync Signed-off-by: Selvaraj Anandaraj <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * change default and add key to config files (#8594) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fix triton import guards (#8552) * Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Add config key for dropout position in LoRA adapter (#8583) Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * fix ia3 mlp infused adapter (#8597) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Prevent Redundant Gather for LoRA Sequence Parallel (#8602) * enable layernorm output gathered Signed-off-by: Chen Cui <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Chen Cui <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Accelerate `transcribe_speech.py` for short-form data: pre-sorting support (#8564) * POC using bucketing in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * extend to multi task aed Signed-off-by: Piotr Żelasko <[email protected]> * fixes for aed multi task text/lang field selectors Signed-off-by: Piotr Żelasko <[email protected]> * remove assert Signed-off-by: Piotr Żelasko <[email protected]> * fix Signed-off-by: Piotr Żelasko <[email protected]> * expose option for bucket buffer size Signed-off-by: Piotr Żelasko <[email protected]> * fixes, ctc support Signed-off-by: Piotr Żelasko <[email protected]> * support pre-sorting manifests in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * cleanup Signed-off-by: Piotr Żelasko <[email protected]> * reorder transcriptions back to original manifest order Signed-off-by: Piotr Żelasko <[email protected]> * remove bucketing entirely Signed-off-by: Piotr Żelasko <[email protected]> * code review changes Signed-off-by: Piotr Żelasko <[email protected]> * code review changes--amend Signed-off-by: Piotr Żelasko <[email protected]> * refactor text_field/lang_field passing Signed-off-by: Piotr Żelasko <[email protected]> * Fix reordering bug; disable presorting for multi task for now Signed-off-by: Piotr Żelasko <[email protected]> * Add support for presort + multi task model Signed-off-by: Piotr Żelasko <[email protected]> * Code reviews Signed-off-by: Piotr Żelasko <[email protected]> * Fix jenkins tests, add user-friendly error msg for canary Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> * fix tests Signed-off-by: Piotr Żelasko <[email protected]> * Bump min required lhotse version Signed-off-by: Piotr Żelasko <[email protected]> * Add some documentation about this config format and the multimodal features Signed-off-by: Piotr Żelasko <[email protected]> * Add caution about multiple shards Signed-off-by: Piotr Żelasko <[email protected]> * Address Tom's code review Signed-off-by: Piotr Żelasko <[email protected]> * Add copyright header Signed-off-by: Piotr Żelasko <[email protected]> * Fix (hopefully) issue with forced ascii encoding in CI Signed-off-by: Piotr Żelasko <[email protected]> * Support resolving input_cfg path into config contents Signed-off-by: Piotr Żelasko <[email protected]> * Code review changes in docs Signed-off-by: Piotr Żelasko <[email protected]> * Fix unicode decode error Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: AlexGrinch <[email protected]> Signed-off-by: Isaac McFadyen <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: Selvaraj Anandaraj <[email protected]> Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Michal Futrega <[email protected]> Co-authored-by: w4-jinhyeonkim <[email protected]> Co-authored-by: Aleksey Grinchuk (Oleksii Hrinchuk) <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Isaac McFadyen <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Chen Cui <[email protected]> Co-authored-by: Michal Futrega <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
…t-only) dataloading (NVIDIA#8581) * wip Signed-off-by: Piotr Żelasko <[email protected]> * Partially working config groups Signed-off-by: Piotr Żelasko <[email protected]> * Working test with abasic group in the input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with nested groups in input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with specifying a YAML path for input_cfg Signed-off-by: Piotr Żelasko <[email protected]> * a very rough example of text dataloading via lhotse Signed-off-by: Piotr Żelasko <[email protected]> * Cleaner integration of multimodal audio/text loading that allows to control the effective audio vs text size (requires latest lhotse) Signed-off-by: Piotr Żelasko <[email protected]> * remove obsolete test Signed-off-by: Piotr Żelasko <[email protected]> * Fix an import in export_utils.py (NVIDIA#8571) Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Yttm deprecation (NVIDIA#8322) * yttm deprecation init commit Signed-off-by: AlexGrinch <[email protected]> * removed tests Signed-off-by: AlexGrinch <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix Signed-off-by: AlexGrinch <[email protected]> * path fix Signed-off-by: AlexGrinch <[email protected]> * fixing path Signed-off-by: AlexGrinch <[email protected]> * updated tests to spm Signed-off-by: AlexGrinch <[email protected]> * updated Jenkinsfile Signed-off-by: AlexGrinch <[email protected]> * new model with spm in tests Signed-off-by: AlexGrinch <[email protected]> * yttm removed Signed-off-by: AlexGrinch <[email protected]> * updated aayn config Signed-off-by: AlexGrinch <[email protected]> --------- Signed-off-by: AlexGrinch <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed missing copy import in rnnt_decoder.py (NVIDIA#8580) * Added copy import to rnnt_decoding.py Signed-off-by: Isaac McFadyen <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Isaac McFadyen <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fix bug in RNNT Joint WER calculation for fused batch (NVIDIA#8587) Signed-off-by: smajumdar <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed Context Parallel HtoD sync (NVIDIA#8557) * Fixed cp HtoD sync Signed-off-by: Selvaraj Anandaraj <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * change default and add key to config files (NVIDIA#8594) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fix triton import guards (NVIDIA#8552) * Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Add config key for dropout position in LoRA adapter (NVIDIA#8583) Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * fix ia3 mlp infused adapter (NVIDIA#8597) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Prevent Redundant Gather for LoRA Sequence Parallel (NVIDIA#8602) * enable layernorm output gathered Signed-off-by: Chen Cui <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Chen Cui <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Accelerate `transcribe_speech.py` for short-form data: pre-sorting support (NVIDIA#8564) * POC using bucketing in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * extend to multi task aed Signed-off-by: Piotr Żelasko <[email protected]> * fixes for aed multi task text/lang field selectors Signed-off-by: Piotr Żelasko <[email protected]> * remove assert Signed-off-by: Piotr Żelasko <[email protected]> * fix Signed-off-by: Piotr Żelasko <[email protected]> * expose option for bucket buffer size Signed-off-by: Piotr Żelasko <[email protected]> * fixes, ctc support Signed-off-by: Piotr Żelasko <[email protected]> * support pre-sorting manifests in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * cleanup Signed-off-by: Piotr Żelasko <[email protected]> * reorder transcriptions back to original manifest order Signed-off-by: Piotr Żelasko <[email protected]> * remove bucketing entirely Signed-off-by: Piotr Żelasko <[email protected]> * code review changes Signed-off-by: Piotr Żelasko <[email protected]> * code review changes--amend Signed-off-by: Piotr Żelasko <[email protected]> * refactor text_field/lang_field passing Signed-off-by: Piotr Żelasko <[email protected]> * Fix reordering bug; disable presorting for multi task for now Signed-off-by: Piotr Żelasko <[email protected]> * Add support for presort + multi task model Signed-off-by: Piotr Żelasko <[email protected]> * Code reviews Signed-off-by: Piotr Żelasko <[email protected]> * Fix jenkins tests, add user-friendly error msg for canary Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> * fix tests Signed-off-by: Piotr Żelasko <[email protected]> * Bump min required lhotse version Signed-off-by: Piotr Żelasko <[email protected]> * Add some documentation about this config format and the multimodal features Signed-off-by: Piotr Żelasko <[email protected]> * Add caution about multiple shards Signed-off-by: Piotr Żelasko <[email protected]> * Address Tom's code review Signed-off-by: Piotr Żelasko <[email protected]> * Add copyright header Signed-off-by: Piotr Żelasko <[email protected]> * Fix (hopefully) issue with forced ascii encoding in CI Signed-off-by: Piotr Żelasko <[email protected]> * Support resolving input_cfg path into config contents Signed-off-by: Piotr Żelasko <[email protected]> * Code review changes in docs Signed-off-by: Piotr Żelasko <[email protected]> * Fix unicode decode error Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: AlexGrinch <[email protected]> Signed-off-by: Isaac McFadyen <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: Selvaraj Anandaraj <[email protected]> Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Michal Futrega <[email protected]> Co-authored-by: w4-jinhyeonkim <[email protected]> Co-authored-by: Aleksey Grinchuk (Oleksii Hrinchuk) <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Isaac McFadyen <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Chen Cui <[email protected]> Co-authored-by: Michal Futrega <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
…t-only) dataloading (NVIDIA#8581) * wip Signed-off-by: Piotr Żelasko <[email protected]> * Partially working config groups Signed-off-by: Piotr Żelasko <[email protected]> * Working test with abasic group in the input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with nested groups in input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with specifying a YAML path for input_cfg Signed-off-by: Piotr Żelasko <[email protected]> * a very rough example of text dataloading via lhotse Signed-off-by: Piotr Żelasko <[email protected]> * Cleaner integration of multimodal audio/text loading that allows to control the effective audio vs text size (requires latest lhotse) Signed-off-by: Piotr Żelasko <[email protected]> * remove obsolete test Signed-off-by: Piotr Żelasko <[email protected]> * Fix an import in export_utils.py (NVIDIA#8571) Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Yttm deprecation (NVIDIA#8322) * yttm deprecation init commit Signed-off-by: AlexGrinch <[email protected]> * removed tests Signed-off-by: AlexGrinch <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix Signed-off-by: AlexGrinch <[email protected]> * path fix Signed-off-by: AlexGrinch <[email protected]> * fixing path Signed-off-by: AlexGrinch <[email protected]> * updated tests to spm Signed-off-by: AlexGrinch <[email protected]> * updated Jenkinsfile Signed-off-by: AlexGrinch <[email protected]> * new model with spm in tests Signed-off-by: AlexGrinch <[email protected]> * yttm removed Signed-off-by: AlexGrinch <[email protected]> * updated aayn config Signed-off-by: AlexGrinch <[email protected]> --------- Signed-off-by: AlexGrinch <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed missing copy import in rnnt_decoder.py (NVIDIA#8580) * Added copy import to rnnt_decoding.py Signed-off-by: Isaac McFadyen <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Isaac McFadyen <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fix bug in RNNT Joint WER calculation for fused batch (NVIDIA#8587) Signed-off-by: smajumdar <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed Context Parallel HtoD sync (NVIDIA#8557) * Fixed cp HtoD sync Signed-off-by: Selvaraj Anandaraj <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * change default and add key to config files (NVIDIA#8594) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fix triton import guards (NVIDIA#8552) * Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Add config key for dropout position in LoRA adapter (NVIDIA#8583) Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * fix ia3 mlp infused adapter (NVIDIA#8597) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Prevent Redundant Gather for LoRA Sequence Parallel (NVIDIA#8602) * enable layernorm output gathered Signed-off-by: Chen Cui <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Chen Cui <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Accelerate `transcribe_speech.py` for short-form data: pre-sorting support (NVIDIA#8564) * POC using bucketing in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * extend to multi task aed Signed-off-by: Piotr Żelasko <[email protected]> * fixes for aed multi task text/lang field selectors Signed-off-by: Piotr Żelasko <[email protected]> * remove assert Signed-off-by: Piotr Żelasko <[email protected]> * fix Signed-off-by: Piotr Żelasko <[email protected]> * expose option for bucket buffer size Signed-off-by: Piotr Żelasko <[email protected]> * fixes, ctc support Signed-off-by: Piotr Żelasko <[email protected]> * support pre-sorting manifests in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * cleanup Signed-off-by: Piotr Żelasko <[email protected]> * reorder transcriptions back to original manifest order Signed-off-by: Piotr Żelasko <[email protected]> * remove bucketing entirely Signed-off-by: Piotr Żelasko <[email protected]> * code review changes Signed-off-by: Piotr Żelasko <[email protected]> * code review changes--amend Signed-off-by: Piotr Żelasko <[email protected]> * refactor text_field/lang_field passing Signed-off-by: Piotr Żelasko <[email protected]> * Fix reordering bug; disable presorting for multi task for now Signed-off-by: Piotr Żelasko <[email protected]> * Add support for presort + multi task model Signed-off-by: Piotr Żelasko <[email protected]> * Code reviews Signed-off-by: Piotr Żelasko <[email protected]> * Fix jenkins tests, add user-friendly error msg for canary Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> * fix tests Signed-off-by: Piotr Żelasko <[email protected]> * Bump min required lhotse version Signed-off-by: Piotr Żelasko <[email protected]> * Add some documentation about this config format and the multimodal features Signed-off-by: Piotr Żelasko <[email protected]> * Add caution about multiple shards Signed-off-by: Piotr Żelasko <[email protected]> * Address Tom's code review Signed-off-by: Piotr Żelasko <[email protected]> * Add copyright header Signed-off-by: Piotr Żelasko <[email protected]> * Fix (hopefully) issue with forced ascii encoding in CI Signed-off-by: Piotr Żelasko <[email protected]> * Support resolving input_cfg path into config contents Signed-off-by: Piotr Żelasko <[email protected]> * Code review changes in docs Signed-off-by: Piotr Żelasko <[email protected]> * Fix unicode decode error Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: AlexGrinch <[email protected]> Signed-off-by: Isaac McFadyen <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: Selvaraj Anandaraj <[email protected]> Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Michal Futrega <[email protected]> Co-authored-by: w4-jinhyeonkim <[email protected]> Co-authored-by: Aleksey Grinchuk (Oleksii Hrinchuk) <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Isaac McFadyen <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Chen Cui <[email protected]> Co-authored-by: Michal Futrega <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
…t-only) dataloading (#8581) * wip Signed-off-by: Piotr Żelasko <[email protected]> * Partially working config groups Signed-off-by: Piotr Żelasko <[email protected]> * Working test with abasic group in the input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with nested groups in input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with specifying a YAML path for input_cfg Signed-off-by: Piotr Żelasko <[email protected]> * a very rough example of text dataloading via lhotse Signed-off-by: Piotr Żelasko <[email protected]> * Cleaner integration of multimodal audio/text loading that allows to control the effective audio vs text size (requires latest lhotse) Signed-off-by: Piotr Żelasko <[email protected]> * remove obsolete test Signed-off-by: Piotr Żelasko <[email protected]> * Fix an import in export_utils.py (#8571) Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Yttm deprecation (#8322) * yttm deprecation init commit Signed-off-by: AlexGrinch <[email protected]> * removed tests Signed-off-by: AlexGrinch <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix Signed-off-by: AlexGrinch <[email protected]> * path fix Signed-off-by: AlexGrinch <[email protected]> * fixing path Signed-off-by: AlexGrinch <[email protected]> * updated tests to spm Signed-off-by: AlexGrinch <[email protected]> * updated Jenkinsfile Signed-off-by: AlexGrinch <[email protected]> * new model with spm in tests Signed-off-by: AlexGrinch <[email protected]> * yttm removed Signed-off-by: AlexGrinch <[email protected]> * updated aayn config Signed-off-by: AlexGrinch <[email protected]> --------- Signed-off-by: AlexGrinch <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed missing copy import in rnnt_decoder.py (#8580) * Added copy import to rnnt_decoding.py Signed-off-by: Isaac McFadyen <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Isaac McFadyen <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fix bug in RNNT Joint WER calculation for fused batch (#8587) Signed-off-by: smajumdar <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed Context Parallel HtoD sync (#8557) * Fixed cp HtoD sync Signed-off-by: Selvaraj Anandaraj <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * change default and add key to config files (#8594) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fix triton import guards (#8552) * Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Add config key for dropout position in LoRA adapter (#8583) Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * fix ia3 mlp infused adapter (#8597) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Prevent Redundant Gather for LoRA Sequence Parallel (#8602) * enable layernorm output gathered Signed-off-by: Chen Cui <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Chen Cui <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Accelerate `transcribe_speech.py` for short-form data: pre-sorting support (#8564) * POC using bucketing in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * extend to multi task aed Signed-off-by: Piotr Żelasko <[email protected]> * fixes for aed multi task text/lang field selectors Signed-off-by: Piotr Żelasko <[email protected]> * remove assert Signed-off-by: Piotr Żelasko <[email protected]> * fix Signed-off-by: Piotr Żelasko <[email protected]> * expose option for bucket buffer size Signed-off-by: Piotr Żelasko <[email protected]> * fixes, ctc support Signed-off-by: Piotr Żelasko <[email protected]> * support pre-sorting manifests in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * cleanup Signed-off-by: Piotr Żelasko <[email protected]> * reorder transcriptions back to original manifest order Signed-off-by: Piotr Żelasko <[email protected]> * remove bucketing entirely Signed-off-by: Piotr Żelasko <[email protected]> * code review changes Signed-off-by: Piotr Żelasko <[email protected]> * code review changes--amend Signed-off-by: Piotr Żelasko <[email protected]> * refactor text_field/lang_field passing Signed-off-by: Piotr Żelasko <[email protected]> * Fix reordering bug; disable presorting for multi task for now Signed-off-by: Piotr Żelasko <[email protected]> * Add support for presort + multi task model Signed-off-by: Piotr Żelasko <[email protected]> * Code reviews Signed-off-by: Piotr Żelasko <[email protected]> * Fix jenkins tests, add user-friendly error msg for canary Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> * fix tests Signed-off-by: Piotr Żelasko <[email protected]> * Bump min required lhotse version Signed-off-by: Piotr Żelasko <[email protected]> * Add some documentation about this config format and the multimodal features Signed-off-by: Piotr Żelasko <[email protected]> * Add caution about multiple shards Signed-off-by: Piotr Żelasko <[email protected]> * Address Tom's code review Signed-off-by: Piotr Żelasko <[email protected]> * Add copyright header Signed-off-by: Piotr Żelasko <[email protected]> * Fix (hopefully) issue with forced ascii encoding in CI Signed-off-by: Piotr Żelasko <[email protected]> * Support resolving input_cfg path into config contents Signed-off-by: Piotr Żelasko <[email protected]> * Code review changes in docs Signed-off-by: Piotr Żelasko <[email protected]> * Fix unicode decode error Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: AlexGrinch <[email protected]> Signed-off-by: Isaac McFadyen <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: Selvaraj Anandaraj <[email protected]> Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Michal Futrega <[email protected]> Co-authored-by: w4-jinhyeonkim <[email protected]> Co-authored-by: Aleksey Grinchuk (Oleksii Hrinchuk) <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Isaac McFadyen <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Chen Cui <[email protected]> Co-authored-by: Michal Futrega <[email protected]> Co-authored-by: Pablo Garay <[email protected]> Signed-off-by: Ao Tang <[email protected]>
* Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]>
…t-only) dataloading (NVIDIA#8581) * wip Signed-off-by: Piotr Żelasko <[email protected]> * Partially working config groups Signed-off-by: Piotr Żelasko <[email protected]> * Working test with abasic group in the input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with nested groups in input config Signed-off-by: Piotr Żelasko <[email protected]> * Working test with specifying a YAML path for input_cfg Signed-off-by: Piotr Żelasko <[email protected]> * a very rough example of text dataloading via lhotse Signed-off-by: Piotr Żelasko <[email protected]> * Cleaner integration of multimodal audio/text loading that allows to control the effective audio vs text size (requires latest lhotse) Signed-off-by: Piotr Żelasko <[email protected]> * remove obsolete test Signed-off-by: Piotr Żelasko <[email protected]> * Fix an import in export_utils.py (NVIDIA#8571) Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Yttm deprecation (NVIDIA#8322) * yttm deprecation init commit Signed-off-by: AlexGrinch <[email protected]> * removed tests Signed-off-by: AlexGrinch <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix Signed-off-by: AlexGrinch <[email protected]> * path fix Signed-off-by: AlexGrinch <[email protected]> * fixing path Signed-off-by: AlexGrinch <[email protected]> * updated tests to spm Signed-off-by: AlexGrinch <[email protected]> * updated Jenkinsfile Signed-off-by: AlexGrinch <[email protected]> * new model with spm in tests Signed-off-by: AlexGrinch <[email protected]> * yttm removed Signed-off-by: AlexGrinch <[email protected]> * updated aayn config Signed-off-by: AlexGrinch <[email protected]> --------- Signed-off-by: AlexGrinch <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed missing copy import in rnnt_decoder.py (NVIDIA#8580) * Added copy import to rnnt_decoding.py Signed-off-by: Isaac McFadyen <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Isaac McFadyen <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Fix bug in RNNT Joint WER calculation for fused batch (NVIDIA#8587) Signed-off-by: smajumdar <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fixed Context Parallel HtoD sync (NVIDIA#8557) * Fixed cp HtoD sync Signed-off-by: Selvaraj Anandaraj <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * change default and add key to config files (NVIDIA#8594) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Fix triton import guards (NVIDIA#8552) * Fix triton import guards Signed-off-by: Michal Futrega <[email protected]> * Update attention.py Signed-off-by: Michal Futrega <[email protected]> --------- Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Add config key for dropout position in LoRA adapter (NVIDIA#8583) Signed-off-by: Michal Futrega <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * fix ia3 mlp infused adapter (NVIDIA#8597) Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]> * Prevent Redundant Gather for LoRA Sequence Parallel (NVIDIA#8602) * enable layernorm output gathered Signed-off-by: Chen Cui <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Chen Cui <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Piotr Żelasko <[email protected]> * Accelerate `transcribe_speech.py` for short-form data: pre-sorting support (NVIDIA#8564) * POC using bucketing in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * extend to multi task aed Signed-off-by: Piotr Żelasko <[email protected]> * fixes for aed multi task text/lang field selectors Signed-off-by: Piotr Żelasko <[email protected]> * remove assert Signed-off-by: Piotr Żelasko <[email protected]> * fix Signed-off-by: Piotr Żelasko <[email protected]> * expose option for bucket buffer size Signed-off-by: Piotr Żelasko <[email protected]> * fixes, ctc support Signed-off-by: Piotr Żelasko <[email protected]> * support pre-sorting manifests in transcribe_speech.py Signed-off-by: Piotr Żelasko <[email protected]> * cleanup Signed-off-by: Piotr Żelasko <[email protected]> * reorder transcriptions back to original manifest order Signed-off-by: Piotr Żelasko <[email protected]> * remove bucketing entirely Signed-off-by: Piotr Żelasko <[email protected]> * code review changes Signed-off-by: Piotr Żelasko <[email protected]> * code review changes--amend Signed-off-by: Piotr Żelasko <[email protected]> * refactor text_field/lang_field passing Signed-off-by: Piotr Żelasko <[email protected]> * Fix reordering bug; disable presorting for multi task for now Signed-off-by: Piotr Żelasko <[email protected]> * Add support for presort + multi task model Signed-off-by: Piotr Żelasko <[email protected]> * Code reviews Signed-off-by: Piotr Żelasko <[email protected]> * Fix jenkins tests, add user-friendly error msg for canary Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> * fix tests Signed-off-by: Piotr Żelasko <[email protected]> * Bump min required lhotse version Signed-off-by: Piotr Żelasko <[email protected]> * Add some documentation about this config format and the multimodal features Signed-off-by: Piotr Żelasko <[email protected]> * Add caution about multiple shards Signed-off-by: Piotr Żelasko <[email protected]> * Address Tom's code review Signed-off-by: Piotr Żelasko <[email protected]> * Add copyright header Signed-off-by: Piotr Żelasko <[email protected]> * Fix (hopefully) issue with forced ascii encoding in CI Signed-off-by: Piotr Żelasko <[email protected]> * Support resolving input_cfg path into config contents Signed-off-by: Piotr Żelasko <[email protected]> * Code review changes in docs Signed-off-by: Piotr Żelasko <[email protected]> * Fix unicode decode error Signed-off-by: Piotr Żelasko <[email protected]> --------- Signed-off-by: Piotr Żelasko <[email protected]> Signed-off-by: w4-jinhyeonkim <[email protected]> Signed-off-by: AlexGrinch <[email protected]> Signed-off-by: Isaac McFadyen <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: Selvaraj Anandaraj <[email protected]> Signed-off-by: Chen Cui <[email protected]> Signed-off-by: Michal Futrega <[email protected]> Co-authored-by: w4-jinhyeonkim <[email protected]> Co-authored-by: Aleksey Grinchuk (Oleksii Hrinchuk) <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Isaac McFadyen <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Selvaraj Anandaraj <[email protected]> Co-authored-by: Chen Cui <[email protected]> Co-authored-by: Michal Futrega <[email protected]> Co-authored-by: Pablo Garay <[email protected]>
What does this PR do ?
As far as pkg_resources is concerned, there's no such thing as triton. Instead, it recognizes pytorch-triton. Here's the fix. Also, if the pytorch-triton lib is missing, it raises pkg_resources.DistributionNotFound which currently is not being handled properly.
Collection: nlp
Changelog
Usage
# Add a code snippet demonstrating how to use this
Jenkins CI
To run Jenkins, a NeMo User with write access must comment
jenkins
on the PR.Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information