New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add decorator for custom op and inductor decomp registration #434

Merged

jerryzh168 merged 1 commit into pytorch:main from jerryzh168:executorch-ir2

Jul 2, 2024

Contributor

jerryzh168 commented Jun 25, 2024

Summary:
This PR adds a decorator to register custom op and also an inductor dcomposition.

The goal is for torch.export path to be able to see high level ops like quantize_affine instead of breaking down the op, this is because some backends like xnnpack wants to work with these higher level ops.

This is a redo for #408, difference is we can preserve the enums on the python side in this PR

Test Plan:
regression tests:
python test/quantization/test_quant_api.py
python test/integration/test_integration.py

also need to check performance with python tutorials/quantize_vit/run_vit_b_quant.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot bot commented Jun 25, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/434

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 320b846 with merge base c2cf973 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

jerryzh168 requested review from supriyar, msaroufim, zou3519, HDCharles and kimishpatel

June 25, 2024 02:02

jerryzh168 force-pushed the executorch-ir2 branch from 5186214 to fd6dbc7 Compare

June 25, 2024 20:30

This was referenced Jun 25, 2024

Add decorator for custom op and inductor decomp registration #408

Closed

[RFC] torchao Contributor Guide #391

Open

jerryzh168 force-pushed the executorch-ir2 branch from fd6dbc7 to 4383e08 Compare

June 28, 2024 23:58

zou3519 reviewed

View reviewed changes

torchao/quantization/quant_primitives.py Outdated Show resolved Hide resolved

zou3519 reviewed

View reviewed changes

torchao/quantization/quant_primitives.py Outdated

+                  quant_min: Optional[int] = None,
+                  quant_max: Optional[int] = None,
+                  zero_point_domain: str = "INT",
+                  *,

Contributor

zou3519 Jul 1, 2024

Remove the *, it's not in the schema

kimishpatel reviewed

View reviewed changes

torchao/quantization/quant_primitives.py Outdated Show resolved Hide resolved

kimishpatel reviewed

View reviewed changes

torchao/quantization/quant_primitives.py Outdated

+                      if TORCH_VERSION_AFTER_2_5:
+                          # TODO: change order
+                          lib_namespace = lib.ns
+                          op_name = schema.split("(")[0]

Contributor

kimishpatel Jul 1, 2024

Maybe construct schema object from string and query op name? I thought such a functionality existed, but not sure

Contributor Author

jerryzh168 Jul 1, 2024

oh, not sure if this is possible, cc @zou3519 is there a better way to get op_name here?

Contributor

zou3519 Jul 1, 2024

torch._C.parse_schema will give you a FunctionSchema object

Contributor Author

jerryzh168 Jul 2, 2024

I just used fn.__name__[1:] for now

kimishpatel reviewed

View reviewed changes

torchao/quantization/quant_primitives.py Outdated Show resolved Hide resolved

kimishpatel reviewed

View reviewed changes

test/integration/test_integration.py

                       after_export = model(x)
                       self.assertTrue(torch.equal(after_export, ref))
+                      if api is _int8da_int8w_api:

Contributor

kimishpatel Jul 1, 2024

What is this checking?

Contributor Author

jerryzh168 Jul 1, 2024

this is because right now we will only see these ops for int8da_int8w quantization, other types of quant (e.g. int4 weight only) will call into the efficient kernels directly

we should probably figure out a path for executorch, I think we could abstract this with "layout", what would be a good name here?

jerryzh168 force-pushed the executorch-ir2 branch 4 times, most recently from f2ca7f2 to 3c45fdf Compare

July 2, 2024 00:24

jerryzh168 requested review from zou3519 and kimishpatel

July 2, 2024 00:25

jerryzh168 force-pushed the executorch-ir2 branch 2 times, most recently from 604f69c to cd8d0c1 Compare

July 2, 2024 00:37

zou3519 reviewed

View reviewed changes

torchao/utils.py

+                          # expecting fn.__name__ starts with `_` and we want to take the rest
+                          # to be the name of the custom op
+                          assert fn.__name__[0] == "_", f"Expecting function name starts with `_`, got {fn.__name__}"
+                          op_name = fn.__name__[1:]

Contributor

zou3519 Jul 2, 2024

Can you assert there is no "." or "<" or ">" in fn.name? this can happen with lambdas or local functions

zou3519 approved these changes

View reviewed changes

Contributor

zou3519 left a comment

lgtm from custom ops perspective


          Add decorator for custom op and inductor decomp registration

320b846

Summary:
This PR adds a decorator to register custom op and also an inductor dcomposition.

The goal is for torch.export path to be able to see high level ops like quantize_affine instead of breaking down the op, this is because some backends like xnnpack wants to work with these higher level ops.

This is a redo for pytorch#408, difference is we can preserve the enums on the python side in this PR

Test Plan:
regression tests:
python test/quantization/test_quant_api.py
python test/integration/test_integration.py

also need to check performance with python tutorials/quantize_vit/run_vit_b_quant.py

Reviewers:

Subscribers:

Tasks:

Tags:

jerryzh168 force-pushed the executorch-ir2 branch from cd8d0c1 to 320b846 Compare

July 2, 2024 19:40

jerryzh168 merged commit d1e15b4 into pytorch:main

13 checks passed

jerryzh168 deleted the executorch-ir2 branch

July 2, 2024 20:13

dbyoung18 pushed a commit to dbyoung18/ao that referenced this pull request


          Add decorator for custom op and inductor decomp registration (pytorch…

ff45f79

…#434)

Summary:
This PR adds a decorator to register custom op and also an inductor dcomposition.

The goal is for torch.export path to be able to see high level ops like quantize_affine instead of breaking down the op, this is because some backends like xnnpack wants to work with these higher level ops.

This is a redo for pytorch#408, difference is we can preserve the enums on the python side in this PR

Test Plan:
regression tests:
python test/quantization/test_quant_api.py
python test/integration/test_integration.py

also need to check performance with python tutorials/quantize_vit/run_vit_b_quant.py

Reviewers:

Subscribers:

Tasks:

Tags:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

zou3519 zou3519 approved these changes

supriyar Awaiting requested review from supriyar

msaroufim Awaiting requested review from msaroufim

HDCharles Awaiting requested review from HDCharles

kimishpatel Awaiting requested review from kimishpatel

Labels