[QUANTIZE] Memorizing the quantize node mapping #3233

ZihengJiang · 2019-05-23T05:08:32Z

To avoid duplicated simulated quantize.

ZihengJiang · 2019-05-23T05:20:49Z

Also, let's remove skip_k_conv since it can be expressed by skip_conv_layers, and remove use_stop_fusion since it is mainly for store_lowbit_output. @vinx13 @jwfromm

vinx13 · 2019-05-23T08:55:08Z

Fusion of resnet-50 is broken after this pr.
Previously it is ensured that every subfunction after has int8 output type. This need special handling of residual block

conv - add(bn) - conv
    |---add ------|

If simulated_quantize is not memorized, there will be two simulated_quantize following the first conv2d.
After realize, we have:

conv - i8 - stop_fusion - i32 - add(bn) ... conv
     -i8  - stop_fusion - i32 -----------add |

CSE will merge both i8 and stop_fusion, but not i32 cast:

conv - i8 - stop_fusion - i32 - add(bn) ... conv
                   |-----i32 -------------add |

In this way we ensure that the output is int8.

However, with simulated_quantize memorized, the two i32 cast will be merged because memorization (we call ExprMutator::Mutate with the same simulated_quantize) as a result subfunctions will have i32 output type.
Maybe we can do something with multiref trigger in realize?

tqchen · 2019-05-24T16:42:13Z

@vinx13 would be great if we can find alternates. Ideally, we want to deal with both cases.
It would also be helpful if you can elaborate it a bit, I did not quite get your example.

ZihengJiang · 2019-05-30T23:40:54Z

@vinx13 I may not get the point. Does it matter to merge those two i32 cast? The main residual block still will output 8bit (in front of the stop_fusion). Will it influent the accuracy or just performance?

vinx13 · 2019-05-31T02:23:36Z

@ZihengJiang It will impact the performance. Although Stop_fusion can make sure that conv2d + fused ops produce int8 result, if the int32 casts are merged, it will be put into a separate sub function

tqchen · 2019-06-19T16:54:49Z

@ZihengJiang @vinx13 can you please followup now that #3280 is merged?

vinx13 · 2019-06-20T01:19:45Z

@ZihengJiang please rebase against master

* [QUANTIZE] Support for clip operator * [QUANTIZE] Memorizing the quantize node mapping. * [QUANTIZE] Remove use_stop_fusion and skip_k_conv in qconfig * update * update * update * update

ZihengJiang added 4 commits May 20, 2019 14:52

[QUANTIZE] Support for clip operator

4337d58

Merge branch 'master' of github.com:dmlc/tvm into quantize

85ff7a6

[QUANTIZE] Memorizing the quantize node mapping.

2e725e5

[QUANTIZE] Remove use_stop_fusion and skip_k_conv in qconfig

733560c

ZihengJiang added the status: WIP label May 23, 2019

vinx13 mentioned this pull request Jun 3, 2019

[Relay][Pass] CanonicalizeCast #3280

Merged

Merge branch 'master' into quantize

fa29876

ZihengJiang added status: need review and removed status: WIP labels Jun 22, 2019

ZihengJiang added 4 commits June 22, 2019 09:13

update

d15b89e

update

d9f1dd1

update

668e09b

update

2907e97

eqy approved these changes Jun 22, 2019

View reviewed changes

ZihengJiang merged commit bfb4884 into apache:master Jun 22, 2019

ZihengJiang deleted the quantize branch June 22, 2019 21:59

ZihengJiang restored the quantize branch July 19, 2019 04:29

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUANTIZE] Memorizing the quantize node mapping #3233

[QUANTIZE] Memorizing the quantize node mapping #3233

ZihengJiang commented May 23, 2019

ZihengJiang commented May 23, 2019

vinx13 commented May 23, 2019 •

edited

Loading

tqchen commented May 24, 2019 •

edited

Loading

ZihengJiang commented May 30, 2019 •

edited

Loading

vinx13 commented May 31, 2019

tqchen commented Jun 19, 2019

vinx13 commented Jun 20, 2019

[QUANTIZE] Memorizing the quantize node mapping #3233

[QUANTIZE] Memorizing the quantize node mapping #3233

Conversation

ZihengJiang commented May 23, 2019

ZihengJiang commented May 23, 2019

vinx13 commented May 23, 2019 • edited Loading

tqchen commented May 24, 2019 • edited Loading

ZihengJiang commented May 30, 2019 • edited Loading

vinx13 commented May 31, 2019

tqchen commented Jun 19, 2019

vinx13 commented Jun 20, 2019

vinx13 commented May 23, 2019 •

edited

Loading

tqchen commented May 24, 2019 •

edited

Loading

ZihengJiang commented May 30, 2019 •

edited

Loading