[Flaky Test] Fixes flaky TensorRT Test #15014

perdasilva · 2019-05-21T06:22:26Z

Description

Related to #14978

Experienced TensorRT test suite failure:

======================================================================
FAIL: Run LeNet-5 inference comparison between MXNet and TensorRT.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/local/lib/python3.6/dist-packages/nose/case.py", line 198, in runTest
    self.test(*self.arg)
  File "/work/mxnet/tests/python/tensorrt/test_tensorrt_lenet5.py", line 100, in test_tensorrt_inference
    MXNet = %f, TensorRT = %f""" % (mx_pct, trt_pct)
AssertionError: Diff. between MXNet & TensorRT accuracy too high:
           MXNet = 99.050000, TensorRT = 99.060000

See logs for full details.

I've decreased the sensitivity of the test slightly and modified the error message a little for clarity.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

karan6181 · 2019-05-21T19:48:28Z

@mxnet-label-bot add [Python, Test, pr-awaiting-review]

perdasilva · 2019-05-23T09:35:23Z

I've executed the test 10k times and didn't see it happen.

wkcn · 2019-05-24T03:21:09Z

Merged. Thank you for the fix!

perdasilva changed the title ~~Fixes flaky TensorRT Test~~ [Flaky Test] Fixes flaky TensorRT Test May 21, 2019

Decreases test sensitivity

ed07768

perdasilva force-pushed the fix_flaky_tensorrt_test branch from f7cc87d to ed07768 Compare May 21, 2019 09:37

marcoabreu added pr-awaiting-review PR is waiting for code review Python Test labels May 21, 2019

This was referenced May 23, 2019

Disable flaky test test_tensorrt_lenet5 #15050

Closed

Flaky Test: test_tensorrt_lenet5.test_tensorrt_inference #14978

Closed

apeforest approved these changes May 23, 2019

View reviewed changes

wkcn merged commit 5763ba9 into apache:master May 24, 2019

perdasilva deleted the fix_flaky_tensorrt_test branch May 24, 2019 05:37

apeforest mentioned this pull request May 24, 2019

[MXNET-978] Support higher order gradient for log, log2, log10. #14992

Merged

7 tasks

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

Decreases test sensitivity (apache#15014)

a6eba89

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flaky Test] Fixes flaky TensorRT Test #15014

[Flaky Test] Fixes flaky TensorRT Test #15014

perdasilva commented May 21, 2019 •

edited

Loading

karan6181 commented May 21, 2019

perdasilva commented May 23, 2019

wkcn commented May 24, 2019

[Flaky Test] Fixes flaky TensorRT Test #15014

[Flaky Test] Fixes flaky TensorRT Test #15014

Conversation

perdasilva commented May 21, 2019 • edited Loading

Description

Checklist

Essentials

karan6181 commented May 21, 2019

perdasilva commented May 23, 2019

wkcn commented May 24, 2019

perdasilva commented May 21, 2019 •

edited

Loading