fix flaky test: test_broadcast_binary_op #11875

azai91 · 2018-07-24T21:20:20Z

Description

fix flaky test test_operator.py: test_broadcast_binary_op by casting inputs to float32

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

[ ]fix flaky test test_operator.py: test_broadcast_binary_op by casting inputs to float32

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

azai91 · 2018-07-24T21:22:33Z

#11838 @larroy

eric-haibin-lin · 2018-08-06T22:07:27Z

Could you resolve conflicts?

larroy · 2018-08-17T21:45:09Z

Was this broken again? I had fixed it. Sorry I have deja vu now

larroy · 2018-08-17T21:48:40Z

tests/python/unittest/test_operator.py

+        # doubles as well. This was a flaky test before when using float32. seed 1688524483, 1768433044
+        a = mx.sym.cast(a_, dtype='float64')
+        b = mx.sym.cast(b_, dtype='float64')
+        mx.sym.cast(b, dtype='float64')


extra cast on 1919?

azai91 · 2018-08-17T21:50:34Z

test_bmod is still commented as flaky.

azai91 · 2018-08-17T21:51:55Z

you added the fix for test_binary_op, test_broadcast_binary_op has the same issue.

larroy · 2018-08-17T21:52:33Z

Got it, 👍🏼 can you remove the extra cast? or does it have a purpose / side effect?

larroy · 2018-08-17T21:57:49Z

👍🏼(lgtm)

larroy

👍🏼

KellenSunderland · 2018-08-20T18:12:25Z

tests/python/unittest/test_operator.py

+        a = mx.sym.cast(a_, dtype='float64')
+        b = mx.sym.cast(b_, dtype='float64')
+        # '%' is sensitive to the precision of the calculation.  Force numpy to match mxnet's float32.
+        #check_binary_op_forward(c, lambda a, b: np.float32(a) % np.float32(b), gen_binary_data)


Does having the commented check_binary_op call here help reader reason about the rest of the test? Is it needed?

marcoabreu · 2018-08-20T18:35:30Z

tests/python/unittest/test_operator.py

@@ -1911,10 +1911,17 @@ def test_bdiv(a, b):
        check_binary_op_forward(c, lambda a, b: a / b, gen_broadcast_data, mx_nd_func=mx.nd.divide)
        check_binary_op_backward(c, lambda g_out, a, b: (g_out / b, - g_out * a / (b * b)), gen_broadcast_data)

-    def test_bmod(a, b):
+    def test_bmod(a_, b_):


Are you using a and a_ on purpose? It seems like sometimes you are using a and b, while other times you use a_ and b_

I think a_ and b_ are placeholders here before the sym.cast operator is applied to them and they become a and b. This enforces that they'll be float64s before the broadcast_mod op is applied, which would otherwise cause numerical issues. Then we compare the result of the broadcast_mod (i.e. the c variable) with the python and imperative versions (which are both passed as lambdas to the check).

KellenSunderland

LGTM, thanks for addressing comments.

* cast inputs to f32 * retrigger * retrigger * remove extra cast * remove commented out function * retrigger

cast inputs to f32

985e8dd

retrigger

d7b4867

anirudh2290 added pr-awaiting-response PR is reviewed and waiting for contributor to respond Flaky Test labels Aug 9, 2018

azai91 added 2 commits August 17, 2018 09:07

merge with master

b505def

retrigger

d65efb0

larroy reviewed Aug 17, 2018

View reviewed changes

remove extra cast

81ce1d2

larroy approved these changes Aug 17, 2018

View reviewed changes

KellenSunderland reviewed Aug 20, 2018

View reviewed changes

marcoabreu reviewed Aug 20, 2018

View reviewed changes

azai91 added 2 commits August 20, 2018 13:32

remove commented out function

b736758

retrigger

bf9d9bd

KellenSunderland approved these changes Aug 20, 2018

View reviewed changes

Merge branch 'master' into fix/flaky-test-test_broadcast_binary_op

2cb90df

eric-haibin-lin merged commit c88b8ee into apache:master Aug 26, 2018

anirudh2290 pushed a commit to anirudh2290/mxnet that referenced this pull request Sep 19, 2018

fix flaky test: test_broadcast_binary_op (apache#11875)

bd409d8

* cast inputs to f32 * retrigger * retrigger * remove extra cast * remove commented out function * retrigger

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix flaky test: test_broadcast_binary_op #11875

fix flaky test: test_broadcast_binary_op #11875

azai91 commented Jul 24, 2018 •

edited

Loading

azai91 commented Jul 24, 2018

eric-haibin-lin commented Aug 6, 2018

larroy commented Aug 17, 2018

larroy Aug 17, 2018 •

edited

Loading

azai91 commented Aug 17, 2018

azai91 commented Aug 17, 2018

larroy commented Aug 17, 2018

larroy commented Aug 17, 2018

larroy left a comment

KellenSunderland Aug 20, 2018

marcoabreu Aug 20, 2018

KellenSunderland Aug 20, 2018 •

edited

Loading

KellenSunderland left a comment

fix flaky test: test_broadcast_binary_op #11875

fix flaky test: test_broadcast_binary_op #11875

Conversation

azai91 commented Jul 24, 2018 • edited Loading

Description

Checklist

Essentials

Changes

Comments

azai91 commented Jul 24, 2018

eric-haibin-lin commented Aug 6, 2018

larroy commented Aug 17, 2018

larroy Aug 17, 2018 • edited Loading

Choose a reason for hiding this comment

azai91 commented Aug 17, 2018

azai91 commented Aug 17, 2018

larroy commented Aug 17, 2018

larroy commented Aug 17, 2018

larroy left a comment

Choose a reason for hiding this comment

KellenSunderland Aug 20, 2018

Choose a reason for hiding this comment

marcoabreu Aug 20, 2018

Choose a reason for hiding this comment

KellenSunderland Aug 20, 2018 • edited Loading

Choose a reason for hiding this comment

KellenSunderland left a comment

Choose a reason for hiding this comment

azai91 commented Jul 24, 2018 •

edited

Loading

larroy Aug 17, 2018 •

edited

Loading

KellenSunderland Aug 20, 2018 •

edited

Loading