[Bugfix][Arith] Avoid flaky test with TryCombineSplitFromSameSource #15128

Lunderberg · 2023-06-20T17:27:08Z

This commit resolves a flaky test failure that was introduced in #15081. The unit test tests/python/unittest/test_meta_schedule_schedule_rule_mlt_tc.py::test_padded_matmul_relu failed approximately 30% of the time.

The error was due to changes in IterMapRewriter::NormalizeToIterWithOffset. When attempting a simplfication with TryCombineSplitFromSameSource, if the returned Optional<IterSumExpr> is defined, but the args.size() < 1 check fails, then the expr argument is still overwritten, and presumably can cause a later TryFuseIters(expr) to fail.

This commit replaces expr = opt.value(); with auto combined = opt.value();, preserving the original argument.

This commit resolves a flaky test failure that was introduced in apache#15081. The unit test `tests/python/unittest/test_meta_schedule_schedule_rule_mlt_tc.py::test_padded_matmul_relu` failed approximately 30% of the time. The error was due to changes in `IterMapRewriter::NormalizeToIterWithOffset`. When attempting a simplfication with `TryCombineSplitFromSameSource`, if the returned `Optional<IterSumExpr>` is defined, but the `args.size() < 1` check fails, then the `expr` argument is still overwritten, and presumably can cause a later `TryFuseIters(expr)` to fail. This commit replaces `expr = opt.value();` with `auto combined = opt.value();`, preserving the original argument.

tvm-bot · 2023-06-20T17:27:11Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @junrushao _{See #10317 for details}

_{Generated by tvm-bot}

tvm-bot · 2023-06-20T17:27:12Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @junrushao _{See #10317 for details}

_{Generated by tvm-bot}

junrushao

Thanks for the quick fix!

Lunderberg · 2023-06-20T19:03:46Z

No problem! Once I had it bisected down, it was relatively quick to apply a fix as well.

Lunderberg · 2023-06-20T19:47:05Z

And I spoke too soon. The original "fix" also prevent several simplifications that are needed for index simplification.

tqchen · 2023-06-20T20:30:06Z

Interesting, @Lunderberg can you send the simplification formula around the regression? i can take a look as well. Ideally the thing should be deterministic, and seems right now they are not

tqchen · 2023-06-20T23:22:55Z

I took a look and send in a fix here #15131

Lunderberg · 2023-06-21T00:22:00Z

I'm not sure at the moment which expression is failing to simplify on main. For some of the choices of split factors, it resulted in the buffer C not being recognized as a region cover. But I didn't quite get to the point of narrowing it down to a specific expression before the end of the day.

With the buggy fix, it broke simplification of n * (x//n) + x%n => x. Closing this PR in favor of your fix.

Lunderberg · 2023-06-21T13:33:37Z

For complete-ness, it looks like the failure can be traced back to the call to EstimateRegionLowerBound that occurs here, which returns NullOpt with the following arguments.

x = ax0_ax1_ax3_ax4_ax5_fused
y = ax0_0_0_ax1_0_0_fused

var_dom = {x: T.Range(0, 2048)}
indices = [x // 1024, y * 2 + x % 1024 // 512, ax2, x % 512 // 256, x % 256 // 16, x % 16]
predicate = ((x // 1024 * 64 + ax2 * 16 + x % 256 // 16 < 127) and
             (y * 64 + x % 1024 // 512 * 32 + x % 512 // 256 * 16 + x % 16 < 127))

tqchen · 2023-06-21T13:46:25Z

Might be useful for us to send a regression

Lunderberg requested a review from tqchen June 20, 2023 17:27

Lunderberg mentioned this pull request Jun 20, 2023

[TIR] Handle DeclBuffer in CacheReadWrite schedule primitive #15037

Merged

junrushao approved these changes Jun 20, 2023

View reviewed changes

Lunderberg closed this Jun 21, 2023

Lunderberg deleted the arith_affine_bugfix branch June 21, 2023 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][Arith] Avoid flaky test with TryCombineSplitFromSameSource #15128

[Bugfix][Arith] Avoid flaky test with TryCombineSplitFromSameSource #15128

Uh oh!

Lunderberg commented Jun 20, 2023

Uh oh!

tvm-bot commented Jun 20, 2023

Uh oh!

tvm-bot commented Jun 20, 2023

Uh oh!

junrushao left a comment

Uh oh!

Lunderberg commented Jun 20, 2023

Uh oh!

Lunderberg commented Jun 20, 2023

Uh oh!

tqchen commented Jun 20, 2023 •

edited

Loading

Uh oh!

tqchen commented Jun 20, 2023

Uh oh!

Lunderberg commented Jun 21, 2023

Uh oh!

Lunderberg commented Jun 21, 2023

Uh oh!

tqchen commented Jun 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Bugfix][Arith] Avoid flaky test with TryCombineSplitFromSameSource #15128

[Bugfix][Arith] Avoid flaky test with TryCombineSplitFromSameSource #15128

Uh oh!

Conversation

Lunderberg commented Jun 20, 2023

Uh oh!

tvm-bot commented Jun 20, 2023

Uh oh!

tvm-bot commented Jun 20, 2023

Uh oh!

junrushao left a comment

Choose a reason for hiding this comment

Uh oh!

Lunderberg commented Jun 20, 2023

Uh oh!

Lunderberg commented Jun 20, 2023

Uh oh!

tqchen commented Jun 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tqchen commented Jun 20, 2023

Uh oh!

Lunderberg commented Jun 21, 2023

Uh oh!

Lunderberg commented Jun 21, 2023

Uh oh!

tqchen commented Jun 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tqchen commented Jun 20, 2023 •

edited

Loading