Commit a34731b
authored
[ROCM] DP4A intrinsic support for TE/TIR (#11009)
* [ROCM] Support dp4a on AMDGPU by sdot4 intrinsic
commit 0225f2b
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 08:56:10 2022 +0900
share op strategy between cuda and rocm
commit 762c7e8
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 08:28:34 2022 +0900
fixed rocm batch_matmul strategy for mixed i8i8i32
commit ce53e8d
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 06:17:30 2022 +0900
add rocm sdot4 TIR intrin
commit f4562b9
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 06:03:44 2022 +0900
rocm sdot4 works
commit 6cc6280
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 05:32:07 2022 +0900
more wip
commit 0602f4a
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 14 03:47:37 2022 +0900
Squashed commit of the following:
commit 65b8bcf
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 20:36:49 2022 +0900
[WIP] adding DP4A support to rocm
commit 4f8f308
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 14:03:25 2022 +0900
Squashed commit of the following:
commit 1711be3
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 13:11:40 2022 +0900
fixed condition for real
commit 8a48fb5
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 09:57:42 2022 +0900
Revert "Skip applying sch_rule when both ann and sch_rule are defined"
This reverts commit 4915c6a.
commit daea033
Author: Masahiro Masuda <[email protected]>
Date: Mon Apr 11 09:31:05 2022 +0900
[Metaschedule] Support rocm and spirv
commit eb0cae2
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 07:25:04 2022 +0900
dp4a works
commit 4915c6a
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 06:13:45 2022 +0900
Skip applying sch_rule when both ann and sch_rule are defined
commit 7b3d71c
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 13 04:40:31 2022 +0900
fixed intrin description
commit 7666cd7
Author: Masahiro Masuda <[email protected]>
Date: Tue Apr 12 19:59:47 2022 +0900
add DP4A intrin
commit 7086bdb
Author: Masahiro Masuda <[email protected]>
Date: Tue Apr 12 19:03:44 2022 +0900
works
commit db34397
Author: Masahiro Masuda <[email protected]>
Date: Tue Apr 12 12:49:52 2022 +0900
more hack to tensorize loop mapping to make resnet50 e2e work
commit 2409674
Author: Masahiro Masuda <[email protected]>
Date: Mon Apr 11 13:40:59 2022 +0900
wip support pad + qnn.conv2d folding
commit 613cb7e
Author: Masahiro Masuda <[email protected]>
Date: Sun Apr 10 12:04:08 2022 +0900
hack to tensorize loop mapping to make conv2d work
commit 9e4f9df
Author: Masahiro Masuda <[email protected]>
Date: Sun Apr 10 11:34:13 2022 +0900
wrap tensorize with try/catch
commit d4b496d
Author: Masahiro Masuda <[email protected]>
Date: Sun Apr 10 11:33:39 2022 +0900
revert change in task_scheduler.cc
commit 476129b
Author: Masahiro Masuda <[email protected]>
Date: Sat Apr 9 05:54:10 2022 +0900
try / catch in ThreadedApply
commit d8226ff
Author: Masahiro Masuda <[email protected]>
Date: Fri Apr 8 17:17:59 2022 +0900
filter out invalid candidate
commit 2632899
Author: Masahiro Masuda <[email protected]>
Date: Fri Apr 8 10:09:48 2022 +0900
try graceful exit in parallel_for_dynamic
commit 9d6741c
Author: Masahiro Masuda <[email protected]>
Date: Fri Apr 8 09:35:51 2022 +0900
[QNN] Fix broadcast for invalid axis
commit 6ccde09
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 20:51:15 2022 +0900
refactor rewrite_tensorize
commit 2ce2066
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 20:48:17 2022 +0900
allow missing schedule_rule in post order apply
commit 3a69353
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 19:42:48 2022 +0900
refactor rewrite_tensorize
commit 43e0b2f
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 18:25:14 2022 +0900
rewrite_vnni -> rewrite_tensorize
commit 823797e
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 18:12:12 2022 +0900
VNNI -> WithIntrin
commit 4284a47
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 17:45:41 2022 +0900
introduce TileForIntrin
commit b87ef32
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 17:34:04 2022 +0900
move TilingwithTensorIntrin to auto_tensorize.cc
commit 2fc118b
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 17:28:45 2022 +0900
clean up headers
commit d8b2aa3
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 17:09:32 2022 +0900
clean up using namespace
commit eb05d25
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 17:03:05 2022 +0900
refactored init
commit 5e6b0a0
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 16:57:14 2022 +0900
compiled
commit 2b8c430
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 12:51:55 2022 +0900
wip MultiLevelTiling refactor
commit 7c21a9f
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:58:33 2022 +0900
function doc string not supported by tvmscript
commit 40f9742
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:56:45 2022 +0900
update vnni intrin name
commit 4814f82
Merge: e0c5eb8 07bbb38
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:44:47 2022 +0900
Merge branch 'tir-tensor-intrin' into auto-tensorize-vnni
commit 07bbb38
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:24:56 2022 +0900
more lint fix
commit 15e60b4
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:16:08 2022 +0900
black
commit 7a757fe
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:12:54 2022 +0900
pylint
commit 9a3e508
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:58:52 2022 +0900
simplify import
commit d8e43ec
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:52:50 2022 +0900
use vectorlow/high in arm intrin
commit 625cd27
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:34:57 2022 +0900
fixed offset factor
commit 69e72b6
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:12:02 2022 +0900
Add ARM intrin
commit 1351fde
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 08:27:27 2022 +0900
use buffer syntax sugar
commit 0ced85f
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 08:17:43 2022 +0900
rename vnni.py to x86.py
commit 38a5aca
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:24:44 2022 +0900
add VNNI unittest
commit 88b763e
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:10:06 2022 +0900
refactored existing test using VNNI intrin
commit 711a007
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:04:58 2022 +0900
[TIR] Add VNNI dot product intrinsic for TIR
commit e0c5eb8
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:42:26 2022 +0900
merge fix
commit b171748
Merge: 71fe3bd 82e152a
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:33:59 2022 +0900
Merge branch 'tir-tensor-intrin' into auto-tensorize-vnni
commit 71fe3bd
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 06:57:38 2022 +0900
move tensor intrin under tir
commit 0c51bad
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 06:12:39 2022 +0900
remove log
commit fed910e
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 06:11:22 2022 +0900
more revert
commit 7150aff
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 06:10:44 2022 +0900
revert stmt_functor change
commit 155107b
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 06:10:09 2022 +0900
refactored RewriteVNNI a bit
commit ca15255
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 05:41:13 2022 +0900
add RewriteVNNI
commit dc9f71d
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 05:38:56 2022 +0900
vectorized init loop
commit fcc31ee
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 04:55:36 2022 +0900
tensorize worked
commit 2b53437
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 6 19:11:05 2022 +0900
TilingwithTensorIntrin works
commit 86baa31
Author: Masahiro Masuda <[email protected]>
Date: Wed Apr 6 08:58:27 2022 +0900
Ported auto-tensorization code
commit 82e152a
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:24:56 2022 +0900
more lint fix
commit 88d9bdd
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:16:08 2022 +0900
black
commit 31fe7eb
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 11:12:54 2022 +0900
pylint
commit 7876754
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:58:52 2022 +0900
simplify import
commit 56f2e9a
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:52:50 2022 +0900
use vectorlow/high in arm intrin
commit 995cc8d
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:34:57 2022 +0900
fixed offset factor
commit 86bbd49
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 10:12:02 2022 +0900
Add ARM intrin
commit 120fd96
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 08:27:27 2022 +0900
use buffer syntax sugar
commit 0f0682d
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 08:17:43 2022 +0900
rename vnni.py to x86.py
commit f88c31e
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:24:44 2022 +0900
add VNNI unittest
commit 6cc8009
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:10:06 2022 +0900
refactored existing test using VNNI intrin
commit 11a29c7
Author: Masahiro Masuda <[email protected]>
Date: Thu Apr 7 07:04:58 2022 +0900
[TIR] Add VNNI dot product intrinsic for TIR
* cleanup
* black
* update dot prod intrin
* add mattr kind
* conv2d topi test working
* add dense and bmm test
* add conv2d relay test
* add tir intrin test
* pylint1 parent 3d63b2d commit a34731b
File tree
20 files changed
+358
-273
lines changed- python/tvm
- relay
- op/strategy
- qnn/op
- tir/tensor_intrin
- topi
- cuda
- rocm
- src/target
- tests/python
- relay
- topi/python
- unittest
20 files changed
+358
-273
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
145 | 145 | | |
146 | 146 | | |
147 | 147 | | |
148 | | - | |
| 148 | + | |
149 | 149 | | |
150 | 150 | | |
151 | 151 | | |
| |||
297 | 297 | | |
298 | 298 | | |
299 | 299 | | |
300 | | - | |
| 300 | + | |
301 | 301 | | |
302 | 302 | | |
303 | 303 | | |
| |||
376 | 376 | | |
377 | 377 | | |
378 | 378 | | |
379 | | - | |
| 379 | + | |
380 | 380 | | |
381 | 381 | | |
382 | 382 | | |
| |||
836 | 836 | | |
837 | 837 | | |
838 | 838 | | |
839 | | - | |
| 839 | + | |
840 | 840 | | |
841 | 841 | | |
842 | 842 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
21 | 20 | | |
22 | 21 | | |
23 | 22 | | |
24 | 23 | | |
25 | 24 | | |
26 | 25 | | |
27 | | - | |
| 26 | + | |
28 | 27 | | |
29 | 28 | | |
30 | 29 | | |
31 | 30 | | |
32 | 31 | | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | 32 | | |
37 | 33 | | |
38 | | - | |
39 | | - | |
40 | 34 | | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
59 | | - | |
60 | | - | |
61 | | - | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | 35 | | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
83 | | - | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
88 | | - | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | | - | |
94 | | - | |
95 | | - | |
| 36 | + | |
96 | 37 | | |
97 | | - | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
104 | 52 | | |
105 | | - | |
106 | | - | |
107 | | - | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | 53 | | |
177 | 54 | | |
178 | 55 | | |
179 | 56 | | |
180 | 57 | | |
181 | 58 | | |
182 | 59 | | |
183 | | - | |
184 | | - | |
185 | | - | |
186 | | - | |
187 | | - | |
188 | | - | |
| 60 | + | |
| 61 | + | |
189 | 62 | | |
190 | 63 | | |
191 | 64 | | |
| |||
200 | 73 | | |
201 | 74 | | |
202 | 75 | | |
203 | | - | |
204 | | - | |
205 | | - | |
206 | | - | |
207 | | - | |
208 | | - | |
209 | | - | |
| 76 | + | |
| 77 | + | |
210 | 78 | | |
211 | 79 | | |
212 | 80 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
| 27 | + | |
27 | 28 | | |
28 | 29 | | |
29 | 30 | | |
| |||
387 | 388 | | |
388 | 389 | | |
389 | 390 | | |
390 | | - | |
391 | | - | |
392 | | - | |
393 | | - | |
394 | | - | |
395 | | - | |
396 | | - | |
397 | | - | |
398 | | - | |
399 | | - | |
400 | | - | |
401 | | - | |
402 | 391 | | |
403 | 392 | | |
404 | 393 | | |
| |||
456 | 445 | | |
457 | 446 | | |
458 | 447 | | |
459 | | - | |
| 448 | + | |
460 | 449 | | |
461 | 450 | | |
462 | | - | |
| 451 | + | |
463 | 452 | | |
464 | 453 | | |
465 | 454 | | |
466 | 455 | | |
467 | 456 | | |
468 | 457 | | |
469 | 458 | | |
470 | | - | |
| 459 | + | |
471 | 460 | | |
472 | 461 | | |
473 | | - | |
| 462 | + | |
474 | 463 | | |
475 | 464 | | |
476 | | - | |
477 | 465 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
| 22 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
0 commit comments