[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization #2877

vinx13 · 2019-03-22T07:52:56Z

Quantize dense layers in quantization pass
Add out_dtype to DenseAttrs to support mixed precision.
Add Int8 dense schedule on CUDA

@eqy @icemelon9 @ZihengJiang

tqchen · 2019-03-27T01:20:42Z

cc @masahi @nishi-t @kazum @ajtulloch can you help review this PR?

vinx13 · 2019-04-09T02:31:25Z

@merrymercy @yzhliu could you also help review?

python/tvm/relay/quantize/quantize.py

tqchen · 2019-04-18T22:15:29Z

@yzhliu, please https://docs.tvm.ai/contribute/code_review.html#approve-and-request-changes-explicitly

vinx13 · 2019-04-24T05:28:32Z

@ZihengJiang @tqchen can you also take a look?

python/tvm/relay/quantize/quantize.py

ZihengJiang · 2019-04-26T16:42:38Z

Merged, thanks for the hard-working

apache#2877) * Quantize dense layers * Add out_dtype arggument to dense; Add dense_int8 on CUDA * Add topi unittest of dense int8 * Fix relay * Fix topi integration * Fix quantization * Update dense_rewrite * Triger CI * Change qconfig quantize_dense to quantize_op * Fix * Remove quantize_op from qconfig

tqchen added the status: need review label Mar 24, 2019

tqchen assigned ZihengJiang Mar 31, 2019

vinx13 force-pushed the feature/quantize_dense branch 2 times, most recently from 190943a to 6a6082b Compare April 3, 2019 02:10

vinx13 mentioned this pull request Apr 9, 2019

add the dense op quantization support #2992

Closed

vinx13 added 7 commits April 11, 2019 14:02

Quantize dense layers

ed6da5c

Add out_dtype arggument to dense; Add dense_int8 on CUDA

b8e385e

Add topi unittest of dense int8

e685268

Fix relay

3ef73a2

Fix topi integration

c2b9ed7

Fix quantization

e66fb92

Update dense_rewrite

bbfa578

vinx13 force-pushed the feature/quantize_dense branch from 89f1154 to bbfa578 Compare April 11, 2019 06:03

Triger CI

21686a8

yzhliu reviewed Apr 13, 2019

View reviewed changes

python/tvm/relay/quantize/quantize.py Outdated Show resolved Hide resolved

vinx13 added 2 commits April 15, 2019 11:15

Change qconfig quantize_dense to quantize_op

8801cbd

Fix

029afb2

Merge branch 'master' into feature/quantize_dense

c50a375

yzhliu approved these changes Apr 23, 2019

View reviewed changes

ZihengJiang reviewed Apr 25, 2019

View reviewed changes

python/tvm/relay/quantize/quantize.py Outdated Show resolved Hide resolved

vinx13 force-pushed the feature/quantize_dense branch from 7f7dbdf to ba0709e Compare April 25, 2019 23:54

Remove quantize_op from qconfig

a4e26da

vinx13 force-pushed the feature/quantize_dense branch from ba0709e to a4e26da Compare April 25, 2019 23:55

ZihengJiang approved these changes Apr 26, 2019

View reviewed changes

ZihengJiang merged commit cc09497 into apache:master Apr 26, 2019

ZihengJiang added the status: accepted label Apr 26, 2019

ZihengJiang removed the status: need review label Apr 26, 2019

merrymercy mentioned this pull request May 2, 2019

[TOPI] Fix mali conv2d performance regression #3131

Merged

masahi mentioned this pull request May 5, 2019

[ROCm] Fix dense autotvm template registration #3136

Merged

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization #2877

[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization #2877

vinx13 commented Mar 22, 2019

tqchen commented Mar 27, 2019 •

edited

Loading

vinx13 commented Apr 9, 2019

tqchen commented Apr 18, 2019

vinx13 commented Apr 24, 2019

ZihengJiang commented Apr 26, 2019

[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization #2877

[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization #2877

Conversation

vinx13 commented Mar 22, 2019

tqchen commented Mar 27, 2019 • edited Loading

vinx13 commented Apr 9, 2019

tqchen commented Apr 18, 2019

vinx13 commented Apr 24, 2019

ZihengJiang commented Apr 26, 2019

tqchen commented Mar 27, 2019 •

edited

Loading