[TOPI] add basic scheduling for conv2d_transpose on x86 #3491

yidawang · 2019-07-04T01:29:13Z

As the title. For workload of mask r-cnn, the #FLOPS improves from 0.9G to 90G on c5.18xlarge instance of Amazon EC2.

Finer tuning and autotvm remain the future work.

yzhliu · 2019-07-05T23:49:56Z

topi/python/topi/x86/conv2d_transpose.py

+    def traverse(op):
+        """Traverse operators from computation graph"""
+        # inline all one-to-one-mapping operators except the last stage (output)
+        if tag.is_broadcast(op.tag) or tag.is_injective(op.tag):


is_injective implies is_broadcast

yzhliu · 2019-07-05T23:55:06Z

topi/python/topi/x86/conv2d_transpose.py

+def _declaration_conv2d_transpose(cfg, data, kernel, strides, padding, out_dtype):
+    return _declaration_conv2d_transpose_impl(cfg, data, kernel, strides, padding, out_dtype)
+
+def _declaration_conv2d_transpose_impl(cfg, data, kernel, strides, padding, out_dtype):


can we put this function to nn/conv2d_transpose.py and have nn.conv2d_transpose_nchw and _declaration_conv2d_transpose share the implement?

Makes sense. But I am assuming that cfg will be used in the future within this function. If we move it to nn/conv2d_transpose.py, I am not sure how to deal with it. What do you suggest? Thanks!

Anyway, I modified as you suggested and put a TODO for now.

ajtulloch · 2019-07-05T23:59:28Z

@yidawang do you folks happen to have an E2E example of Mask R-CNN with TVM? We'd be interested in this at FB for our object detector work.

yidawang · 2019-07-06T12:46:00Z

@yidawang do you folks happen to have an E2E example of Mask R-CNN with TVM? We'd be interested in this at FB for our object detector work.

I used the models here to benchmark, and quickly modified this file to run it E2E.

yzhliu · 2019-07-07T03:11:37Z

Thanks @yidawang

* initialize cond 2d transpose scheduling on x86 * refine the scheduler a bit * fix for lint * address review comments; remove duplicate code * fix lint

yidawang added 3 commits July 3, 2019 10:44

initialize cond 2d transpose scheduling on x86

592cfac

refine the scheduler a bit

320cb37

fix for lint

2c95e68

yzhliu reviewed Jul 5, 2019

View reviewed changes

yidawang added 3 commits July 6, 2019 06:04

address review comments; remove duplicate code

8124f7e

Merge remote-tracking branch 'upstream/master' into deconv_x86

4f6db8c

fix lint

8f3ae07

tqchen assigned yzhliu Jul 6, 2019

yzhliu approved these changes Jul 7, 2019

View reviewed changes

yzhliu merged commit f978887 into apache:master Jul 7, 2019

yidawang deleted the deconv_x86 branch July 7, 2019 12:18

tqchen mentioned this pull request Jul 24, 2019

[TOPI] Better x86 support for conv2d_transpose #2658

Closed

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] add basic scheduling for conv2d_transpose on x86 #3491

[TOPI] add basic scheduling for conv2d_transpose on x86 #3491

yidawang commented Jul 4, 2019

yzhliu Jul 5, 2019

yzhliu Jul 5, 2019

yidawang Jul 6, 2019

yidawang Jul 6, 2019

ajtulloch commented Jul 5, 2019

yidawang commented Jul 6, 2019 •

edited

Loading

yzhliu commented Jul 7, 2019

[TOPI] add basic scheduling for conv2d_transpose on x86 #3491

[TOPI] add basic scheduling for conv2d_transpose on x86 #3491

Conversation

yidawang commented Jul 4, 2019

yzhliu Jul 5, 2019

Choose a reason for hiding this comment

yzhliu Jul 5, 2019

Choose a reason for hiding this comment

yidawang Jul 6, 2019

Choose a reason for hiding this comment

yidawang Jul 6, 2019

Choose a reason for hiding this comment

ajtulloch commented Jul 5, 2019

yidawang commented Jul 6, 2019 • edited Loading

yzhliu commented Jul 7, 2019

yidawang commented Jul 6, 2019 •

edited

Loading