Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Relay][Quantization] Speed-aware quantization scheme improvement #2723

Merged
merged 4 commits into from
Mar 9, 2019

Conversation

vinx13
Copy link
Member

@vinx13 vinx13 commented Mar 4, 2019

Writing int32 result to global memory can be much slower than int8. This PR does the following change:

  • in add_rewrite, quantize rhs to int8 so that read/write of rhs can be performed in int8.
  • In UnifyDtypeScale, if the input is simulated_quantize(QInput), cast the input to int8 before casting to int32.

@ZihengJiang

@ZihengJiang ZihengJiang self-assigned this Mar 5, 2019
tqchen
tqchen previously requested changes Mar 5, 2019
src/relay/pass/quantize.cc Show resolved Hide resolved
@ZihengJiang
Copy link
Contributor

for the comment, I mean to explain the code like here

@ZihengJiang
Copy link
Contributor

please fixed the CI @vinx13

@ZihengJiang ZihengJiang merged commit 21e8dfa into apache:master Mar 9, 2019
@ZihengJiang
Copy link
Contributor

Merged, thanks! @vinx13

wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 9, 2019
…ache#2723)

* [Relay][Quantization] Speed-aware quantization scheme improvement

* Add comment

* Add use_stop_fusion to qconfig

* Update comment
wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 12, 2019
…ache#2723)

* [Relay][Quantization] Speed-aware quantization scheme improvement

* Add comment

* Add use_stop_fusion to qconfig

* Update comment
wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 12, 2019
…ache#2723)

* [Relay][Quantization] Speed-aware quantization scheme improvement

* Add comment

* Add use_stop_fusion to qconfig

* Update comment
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants