Fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." #35

zhenhuaw-me · 2020-12-15T14:06:40Z

ONNX quantization doesn't take FP16 as a quantized data type, therefore nearly all FP16 quantized TFLite models are unsupported (see this FAQ).

We recommend user switch to full integer quantization. But if the user cannot do that (for example no TensorFlow model available), we can fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." to workaround this issue.

paulgavrikov · 2021-01-12T11:40:30Z

It would be awesome if you could implement this! Unfortunately, the mediapipe models often rely on fp16 weights.

ram95014 · 2021-01-13T02:58:54Z

I would second @paulgavrikov and request the fold from fp16 to fp32. Thanks. If that is not possible or would take a long time, it would be great if you can guide me to do this.

zhenhuaw-me · 2021-01-13T12:50:53Z

@paulgavrikov @ram95014 Thanks for your feedback! I am very glad about that. This should be possible and should not take too much time but I was not working on it. It would be great if you can help!

In general, it could be divided into three steps:

Parse and build the graph just like ONNX can support FP16 as a quantization type. In this stage, we will have many patterns like FP16 weights -> Dequantize -> FP32 Tensor -> Conv/MatMul which will become illegal if convert to ONNX directly. I think we can support this stage currently but I have not tried that locally.
Walk the graph and fold FP16 weights. We can introduce a new pass like layout propagation or handling quantization - may be name like foldFP16Weights. Search the FP16 weights -> Dequantize -> FP32 Tensor -> Conv/MatMul in the graph (iterate the operators and check), for each pattern P:
1. Cast the weights of the FP16 weights to FP32 directly, including changing Tensor.data, Tensor.dtype and etc.
2. Detach the FP16 weights (which are actually FP32 weights now) and the Dequantize operator, and the FP32 Tensor and the Conv/MatMul operator.
3. Attach the casted FP16 weights (which are actually FP32 weights now) to the Conv/MatMul operator.
4. Remove the Dequantize and FP32 Tensor from the graph.
Recollect the tensors and operator like other graph operator passes.

I would suggest starting by adding a new operator to tflite2onnx to understand the code. They are pretty easy, you may find some in merged PRs.

It would be great if we can bring it up in the next minor release. Let's prioritize it!

zhenhuaw-me · 2021-01-16T12:06:34Z

@paulgavrikov @ram95014 This functionality has been enabled, please try out with the latest code.
If anything looks wrong, please open issues and link to this one. Thanks!

mikkelmedm · 2021-08-17T15:58:49Z

Hi, should this be fixed? I am still having issues

zhenhuaw-me · 2021-08-18T02:02:44Z

@mikkelmedm It has been fixed and protected by this test. What's the error you have?

mikkelmedm · 2021-08-18T06:30:44Z

Running it on a MediaPipe model I am getting an error like "FP16 is not tested, and might not work properly"

zhenhuaw-me added Story How we evolve Enhancement New feature or request labels Dec 15, 2020

zhenhuaw-me mentioned this issue Dec 30, 2020

Unsupported data type: FP16, INT8 and etc. #30

Closed

zhenhuaw-me added the Quantization label Dec 30, 2020

zhenhuaw-me linked a pull request Jan 16, 2021 that will close this issue

Quant: fold FP16 tensors into FP32 #52

Merged

zhenhuaw-me closed this as completed in #52 Jan 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." #35

Fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." #35

zhenhuaw-me commented Dec 15, 2020

paulgavrikov commented Jan 12, 2021

ram95014 commented Jan 13, 2021 •

edited

Loading

zhenhuaw-me commented Jan 13, 2021 •

edited

Loading

zhenhuaw-me commented Jan 16, 2021

mikkelmedm commented Aug 17, 2021

zhenhuaw-me commented Aug 18, 2021

mikkelmedm commented Aug 18, 2021

Fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." #35

Fold "FP16 weights -> Dequantize -> FP32 -> Conv/MatMul..." to "FP32 weights -> Conv/MatMul..." #35

Comments

zhenhuaw-me commented Dec 15, 2020

paulgavrikov commented Jan 12, 2021

ram95014 commented Jan 13, 2021 • edited Loading

zhenhuaw-me commented Jan 13, 2021 • edited Loading

zhenhuaw-me commented Jan 16, 2021

mikkelmedm commented Aug 17, 2021

zhenhuaw-me commented Aug 18, 2021

mikkelmedm commented Aug 18, 2021

ram95014 commented Jan 13, 2021 •

edited

Loading

zhenhuaw-me commented Jan 13, 2021 •

edited

Loading