bert_module: fix inputs of export model #3815

virajkarandikar · 2022-03-09T11:27:57Z

Exported ONNX model had to be passed with "attention_mask" and
"token_type_ids" inputs swapped to get correct output.

Changing the input order fixes this issue.
Also return a dict instead of list for correctly passing inputs.

What does this PR do ?

Fix swapped model inputs for exported ONNX/TRT model.

Collection: NLP

Changelog

Fix order of inputs returned via input_types()

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests? - Updated related tests.
Did you add or update any necessary documentation? - Not required
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc) - No
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

Bugfix

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

@MaximumEntropy, @ericharper, @ekmb, @yzhang123

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]>

ryanleary · 2022-03-10T15:12:36Z

Changes look OK to me.

yzhang123 · 2022-03-11T16:50:19Z

@borisfom please double check before we merge

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <[email protected]> * update the signature Signed-off-by: Masaki Kozuki <[email protected]> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <[email protected]> * fix typo Signed-off-by: Masaki Kozuki <[email protected]> * cosmetic changes Signed-off-by: Masaki Kozuki <[email protected]> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <[email protected]> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <[email protected]> * change how to slice `batch` Signed-off-by: Masaki Kozuki <[email protected]> * update not drop_last path Signed-off-by: Masaki Kozuki <[email protected]> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <[email protected]> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]> * update random batch sampler Signed-off-by: Masaki Kozuki <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Viraj Karandikar <[email protected]> Co-authored-by: Eric Harper <[email protected]>

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <[email protected]> * update the signature Signed-off-by: Masaki Kozuki <[email protected]> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <[email protected]> * fix typo Signed-off-by: Masaki Kozuki <[email protected]> * cosmetic changes Signed-off-by: Masaki Kozuki <[email protected]> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <[email protected]> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <[email protected]> * change how to slice `batch` Signed-off-by: Masaki Kozuki <[email protected]> * update not drop_last path Signed-off-by: Masaki Kozuki <[email protected]> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <[email protected]> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]> * update random batch sampler Signed-off-by: Masaki Kozuki <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Viraj Karandikar <[email protected]> Co-authored-by: Eric Harper <[email protected]>

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <[email protected]> * update the signature Signed-off-by: Masaki Kozuki <[email protected]> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <[email protected]> * fix typo Signed-off-by: Masaki Kozuki <[email protected]> * cosmetic changes Signed-off-by: Masaki Kozuki <[email protected]> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <[email protected]> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <[email protected]> * change how to slice `batch` Signed-off-by: Masaki Kozuki <[email protected]> * update not drop_last path Signed-off-by: Masaki Kozuki <[email protected]> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <[email protected]> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <[email protected]> Co-authored-by: Yang Zhang <[email protected]> * update random batch sampler Signed-off-by: Masaki Kozuki <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Viraj Karandikar <[email protected]> Co-authored-by: Eric Harper <[email protected]>

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from eaca787 to 2cd9f4f Compare March 9, 2022 11:34

virajkarandikar changed the title ~~Draft: bert_module: fix model inputs when exported~~ bert_module: fix model inputs when exported Mar 9, 2022

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch 2 times, most recently from 2bf0c12 to dcac1ab Compare March 9, 2022 18:44

virajkarandikar marked this pull request as ready for review March 9, 2022 18:45

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from dcac1ab to 8d275c7 Compare March 9, 2022 18:47

virajkarandikar changed the title ~~bert_module: fix model inputs when exported~~ bert_module: fix inputs of export model Mar 9, 2022

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from 8d275c7 to 079e148 Compare March 9, 2022 18:48

Merge branch 'main' into vkarandikar/fix_bert_model_inputs

7da2e04

yzhang123 approved these changes Mar 11, 2022

View reviewed changes

borisfom approved these changes Mar 11, 2022

View reviewed changes

borisfom merged commit 3dd8a5c into NVIDIA:main Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert_module: fix inputs of export model #3815

bert_module: fix inputs of export model #3815

virajkarandikar commented Mar 9, 2022 •

edited

Loading

ryanleary commented Mar 10, 2022

yzhang123 commented Mar 11, 2022

bert_module: fix inputs of export model #3815

bert_module: fix inputs of export model #3815

Conversation

virajkarandikar commented Mar 9, 2022 • edited Loading

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Who can review?

Additional Information

ryanleary commented Mar 10, 2022

yzhang123 commented Mar 11, 2022

virajkarandikar commented Mar 9, 2022 •

edited

Loading