Skip to content

Commit fae3ac6

Browse files
cccclaipytorchbot
authored andcommitted
Document Quantizer API and precision options for MTK backend (#15091)
Added Quantizer API section with precision configuration example. (cherry picked from commit fc7d03b)
1 parent 468fa82 commit fae3ac6

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

docs/source/backends-mediatek.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,17 @@ cd executorch
3838
```
3939
The exported `.pte` file is saved in a directory named after the model.
4040

41+
### Quantizer API
42+
43+
Quantizer can be configured with different precision. We currently support A16W16, A16W8, A16W4, A8W8 and A8W4
44+
45+
The example code will be
46+
```python
47+
precision = "A16W16"
48+
quantizer = NeuropilotQuantizer()
49+
quantizer.setup_precision(getattr(Precision, precision))
50+
```
51+
4152
### Partitioner API
4253

4354
A list of CompileSpec is suppported by MediaTek backend:

0 commit comments

Comments
 (0)