[v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. #13897

KellenSunderland · 2019-01-16T05:13:48Z

Description

This PR updates the IR which is passed to TensorRT to use version3 of the spec, which aligns much better with MXNet defaults and results in a decrease in boilerplate code. This update also fixes some bugs when building inference engines that were resulting in feature vectors that were very different from what they should have been.

Fixes #12598
Fixes #13113

Master PR with review history: #13310

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Marco:
Resolves #13459

This works around a CUDA 10 cmake issue documented here: clab/dynet#1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted.

stu1130 · 2019-01-16T17:30:13Z

@mxnet-label-bot add [pr-awaiting-review]
Thanks for the great work @KellenSunderland

KellenSunderland · 2019-01-17T16:26:38Z

@marcoabreu Would you be able to review this PR? It's the same as the other PR, but it makes a small change to the legacy Jenkinsfile to account for cmake builds.

…ugs. (apache#13897) * [MXNET-703] Install CUDA 10 compatible cmake This works around a CUDA 10 cmake issue documented here: clab/dynet#1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted. * [MXNET-703] Update to TensorRT 5 ONNX IR 3. Fix inference bugs. * [MXNET-703] Describe onnx opsets and major version

[MXNET-703] Install CUDA 10 compatible cmake

c593d59

This works around a CUDA 10 cmake issue documented here: clab/dynet#1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted.

KellenSunderland requested review from anirudh2290 and szha as code owners January 16, 2019 05:13

KellenSunderland force-pushed the trt5_mxnet_1_4 branch from 0b73e42 to 615ee40 Compare January 16, 2019 05:18

KellenSunderland changed the title ~~[MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs.~~ [v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. Jan 16, 2019

KellenSunderland force-pushed the trt5_mxnet_1_4 branch from 615ee40 to 09f9e71 Compare January 16, 2019 13:52

KellenSunderland and others added 2 commits January 16, 2019 06:23

[MXNET-703] Update to TensorRT 5 ONNX IR 3. Fix inference bugs.

0612070

[MXNET-703] Describe onnx opsets and major version

cb34855

KellenSunderland force-pushed the trt5_mxnet_1_4 branch from 09f9e71 to cb34855 Compare January 16, 2019 14:23

KellenSunderland requested a review from marcoabreu as a code owner January 16, 2019 14:23

marcoabreu added the pr-awaiting-review PR is waiting for code review label Jan 16, 2019

marcoabreu merged commit 9edf53a into apache:v1.4.x Jan 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. #13897

[v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. #13897

KellenSunderland commented Jan 16, 2019 •

edited

Loading

stu1130 commented Jan 16, 2019

KellenSunderland commented Jan 17, 2019

[v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. #13897

[v1.4.x] [MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. #13897

Conversation

KellenSunderland commented Jan 16, 2019 • edited Loading

Description

Checklist

Essentials

stu1130 commented Jan 16, 2019

KellenSunderland commented Jan 17, 2019

KellenSunderland commented Jan 16, 2019 •

edited

Loading