Retain qnn input kernel scales #4292

u99127 · 2019-11-09T07:19:46Z

The QNN dialect loses the input tensor scale and the weight
tensor scale too early. This inhibits the work one could do
with integrating 3rd party codegen or libraries that have interfaces
that expect input tensor scale and weight tensor scales.

This patch stack fixes it up for conv2d and Dense. I cannot see
any other operators affected yet.

See here for more . https://discuss.tvm.ai/t/lowering-qnn-conv2d-tflite/4654/5

Ramana

@anijain2305 - Please review.

anijain2305

Overall LGTM. The only change I request is changing the name to input_scale and kernel_scale. I have similar naming elsewhere, so will be easy to read.

u99127 · 2019-11-09T07:35:22Z

Overall LGTM. The only change I request is changing the name to input_scale and kernel_scale. I have similar naming elsewhere, so will be easy to read.

Sure.

When I wrote it, I felt it better to keep the name distinct to indicate the difference between input_scale in Requantize vs here but I suppose `tensor' is superfluous to requirements ..

Ramana

u99127 · 2019-11-11T13:35:46Z

Not sure why my testing didn't catch this - hopefully this lot satisfies the CI ...
Ramana

include/tvm/relay/qnn/attrs.h

python/tvm/relay/qnn/op/qnn.py

anijain2305

LGTM, with some comments.

u99127 · 2019-11-12T20:28:27Z

All updates now done ,please review and merge as appropriate .

anijain2305

LGTM

@zhiics @yzhliu Can you please merge?

u99127 · 2019-11-13T19:57:04Z

A gentle ping for a merge.

zhiics

Only one nitpick. Otherwise looks good to me.

python/tvm/relay/qnn/op/qnn.py

u99127 · 2019-11-13T20:36:24Z

Whoops , now fixed.

zhiics

LGTM

kernel_tensor_scale. The lowering in the tflite frontend loses the input_tensor_scale and the kernel_tensor_scale by multiplying it and putting it into the Requantize operation. This means that any graph partitioning passes or other passes that need to access this information no longer have it available in the qnn dialect. regards Ramana

As for conv2d, the tflite frontend drops the input tensor scale and the weight tensor scale from the relay op. Store it as separate fields in there.

to kernel_scale for conv2d.

And use input_scale and kernel_scale

u99127 · 2019-11-15T00:52:23Z

Ok, it looks like there is some more work to be done here because of other bits that have landed.

Fix legalisations.py to consider qnn.conv2d and qnn.dense to filter out the input_scale and kernel_scale before calling nn.conv2d and nn.dense. - Still working on this one.
Fix qnn.op.dense to have input_scale and kernel_scale- I've done it.

anijain2305 · 2019-11-15T00:59:01Z

https://github.com/apache/incubator-tvm/blob/5b9f459d638d1cf6bd820f3bdc58d7e5632d7ed7/python/tvm/relay/qnn/op/legalizations.py#L89-L90

For the first problem, you can just delete the extra scale arguments from the attrs.

nn.conv2d does not contain input_scale and kernel_scale. We need to delete it when lowering it to nn.conv2d.

u99127 · 2019-11-15T13:08:47Z

https://github.com/apache/incubator-tvm/blob/5b9f459d638d1cf6bd820f3bdc58d7e5632d7ed7/python/tvm/relay/qnn/op/legalizations.py#L89-L90

For the first problem, you can just delete the extra scale arguments from the attrs.

Ah thanks - I hadn't spotted that last night.

Now rebased and repushed.

Ramana

u99127 · 2019-11-16T08:42:55Z

Now all done, would be nice to merge.

zhiics · 2019-11-16T16:39:23Z

Thanks @u99127 @anijain2305

* Add qnn conv2d attributes for input_tensor_scale and kernel_tensor_scale. The lowering in the tflite frontend loses the input_tensor_scale and the kernel_tensor_scale by multiplying it and putting it into the Requantize operation. This means that any graph partitioning passes or other passes that need to access this information no longer have it available in the qnn dialect. regards Ramana * Store input tensor scale and Weight tensor scale for Dense as well As for conv2d, the tflite frontend drops the input tensor scale and the weight tensor scale from the relay op. Store it as separate fields in there. * Fix unintentional tab * Rename input_tensor_scale to input_scale and kernel_tensor_scale to kernel_scale for conv2d. * input_tensor_scale -> input_scale weight_tensor_scale->weight_scale * Rework dense testcase And use input_scale and kernel_scale * Be consistent in use of input_scale and kernel_scale values * Fixup qnn conv2d tests for input_scale and kernel_scale * Make pydoc identical between conv2d and dense for weight_tensor * Fix up conv2d parameters to be in the same order between C++ and python * Fix ordering of parameters for dense. * Add input_scale and output_scale to try and satisfy ci gods * Delete input_scale and kernel_scale. nn.conv2d does not contain input_scale and kernel_scale. We need to delete it when lowering it to nn.conv2d. * Add input_scale and kernel_scale for qnn.conv2d

anijain2305 reviewed Nov 9, 2019

View reviewed changes

tqchen added status: need review status: need update need update based on feedbacks labels Nov 10, 2019

anijain2305 reviewed Nov 11, 2019

View reviewed changes

include/tvm/relay/qnn/attrs.h Show resolved Hide resolved

anijain2305 reviewed Nov 11, 2019

View reviewed changes

python/tvm/relay/qnn/op/qnn.py Show resolved Hide resolved

anijain2305 reviewed Nov 11, 2019

View reviewed changes

anijain2305 approved these changes Nov 12, 2019

View reviewed changes

zhiics reviewed Nov 13, 2019

View reviewed changes

python/tvm/relay/qnn/op/qnn.py Show resolved Hide resolved

zhiics approved these changes Nov 13, 2019

View reviewed changes

Ramana Radhakrishnan added 11 commits November 14, 2019 23:50

Store input tensor scale and Weight tensor scale for Dense as well

65998c9

As for conv2d, the tflite frontend drops the input tensor scale and the weight tensor scale from the relay op. Store it as separate fields in there.

Fix unintentional tab

9fdf06d

Rename input_tensor_scale to input_scale and kernel_tensor_scale

30a7bd4

to kernel_scale for conv2d.

input_tensor_scale -> input_scale weight_tensor_scale->weight_scale

16db9dd

Rework dense testcase

7b4bb4e

And use input_scale and kernel_scale

Be consistent in use of input_scale and kernel_scale values

b0d23c0

Fixup qnn conv2d tests for input_scale and kernel_scale

14ecd55

Make pydoc identical between conv2d and dense for weight_tensor

56933a8

Fix up conv2d parameters to be in the same order between C++ and python

9841fad

Fix ordering of parameters for dense.

73e9bfc

Ramana Radhakrishnan added 2 commits November 15, 2019 09:25

Add input_scale and output_scale to try and satisfy ci gods

a9266b2

Delete input_scale and kernel_scale.

46f3c23

nn.conv2d does not contain input_scale and kernel_scale. We need to delete it when lowering it to nn.conv2d.

u99127 force-pushed the retain-qnn-input-kernel-scales branch from 80084dd to 46f3c23 Compare November 15, 2019 13:07

Add input_scale and kernel_scale for qnn.conv2d

6d5932a

anijain2305 mentioned this pull request Nov 15, 2019

[QNN] Lowering for Depthwise Convolution. #4351

Merged

zhiics merged commit 3ba9dd0 into apache:master Nov 16, 2019

zhiics added status: accepted and removed status: need review status: need update need update based on feedbacks labels Nov 16, 2019

tqchen mentioned this pull request Nov 22, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retain qnn input kernel scales #4292

Retain qnn input kernel scales #4292

u99127 commented Nov 9, 2019

anijain2305 left a comment

u99127 commented Nov 9, 2019

u99127 commented Nov 11, 2019

anijain2305 left a comment

u99127 commented Nov 12, 2019

anijain2305 left a comment

u99127 commented Nov 13, 2019

zhiics left a comment

u99127 commented Nov 13, 2019

zhiics left a comment

u99127 commented Nov 15, 2019

anijain2305 commented Nov 15, 2019

u99127 commented Nov 15, 2019

u99127 commented Nov 16, 2019

zhiics commented Nov 16, 2019

Retain qnn input kernel scales #4292

Retain qnn input kernel scales #4292

Conversation

u99127 commented Nov 9, 2019

anijain2305 left a comment

Choose a reason for hiding this comment

u99127 commented Nov 9, 2019

u99127 commented Nov 11, 2019

anijain2305 left a comment

Choose a reason for hiding this comment

u99127 commented Nov 12, 2019

anijain2305 left a comment

Choose a reason for hiding this comment

u99127 commented Nov 13, 2019

zhiics left a comment

Choose a reason for hiding this comment

u99127 commented Nov 13, 2019

zhiics left a comment

Choose a reason for hiding this comment

u99127 commented Nov 15, 2019

anijain2305 commented Nov 15, 2019

u99127 commented Nov 15, 2019

u99127 commented Nov 16, 2019

zhiics commented Nov 16, 2019