Move quant ops to utils.py #331

jerryzh168 · 2024-06-06T23:53:27Z

Summary:
We had a lot of "quant primitive" ops that can be expressed with more primitive ops, so these ops are more of a helper functions now, so we moved them to torchao.quantization.utils

we should be able to further deprecate some of the ops after we deprecate subclasses and refactor smoothquant etc. in the future

Also moved TORCH_VERSION_AFTER_{2_2/2_3/2_4} from torchao.quantization.utils to torchao.utils

After the move we have the following ops in quant_primitives.py:

    "safe_int_mm",
    "int_scaled_matmul",
    "choose_qparams_affine",
    "quantize_affine",
    "dequantize_affine",

Test Plan:
python test/integration/test_integration.py
python test/quantization/test_quant_api.py
python test/quantization/test_quant_primitives.py
python test/quantization/test_qat.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-06-06T23:53:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/331

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rebase your PRs: Unstable CUDA signal in CI caused by cudnn 9 update

✅ No Failures

As of commit eebb7a2 with merge base 000a0fd ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2024-06-07T05:03:30Z

test/quantization/test_qat.py

@@ -18,8 +18,8 @@
    fake_quantize_per_channel_group,
    fake_quantize_per_token,
 )
-from torchao.quantization.quant_primitives import get_group_qparams_symmetric
-from torchao.quantization.utils import TORCH_VERSION_AFTER_2_4
+from torchao.quantization.utils import get_group_qparams_symmetric


I'm having trouble figuring out what should be in quant_primitives vs utils

functions in utils are mostly helper functions that calls the quant_primitive ops with some fixed parameters, e.g.

quantize_affine can support: symmetric/asymmetric, per tensor/group/channel etc., int8/int4/int3

helper function in utils can be: int8_symmetric_per_tensor_quant that calls quantize_affine op with fixed settings

test/integration/test_integration.py

torchao/utils.py

Summary: We had a lot of "quant primitive" ops that can be expressed with more primitive ops, so these ops are more of a helper functions now, so we moved them to torchao.quantization.utils we should be able to further deprecate some of the ops after we deprecate subclasses and refactor smoothquant etc. in the future Also moved TORCH_VERSION_AFTER_{2_2/2_3/2_4} from torchao.quantization.utils to torchao.utils Test Plan: python test/integration/test_integration.py python test/quantization/test_quant_api.py python test/quantization/test_quant_primitives.py Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 6, 2024

jerryzh168 requested review from msaroufim, HDCharles and cpuhrsch June 6, 2024 23:53

jerryzh168 force-pushed the remove-ops branch 3 times, most recently from e58d0a6 to 6a9a5f4 Compare June 7, 2024 02:22

msaroufim reviewed Jun 7, 2024

View reviewed changes

jerryzh168 requested a review from msaroufim June 7, 2024 17:19

msaroufim reviewed Jun 7, 2024

View reviewed changes

test/integration/test_integration.py Outdated Show resolved Hide resolved

msaroufim reviewed Jun 7, 2024

View reviewed changes

torchao/utils.py Outdated Show resolved Hide resolved

jerryzh168 force-pushed the remove-ops branch from 6a9a5f4 to 4b9ed66 Compare June 7, 2024 18:20

jerryzh168 force-pushed the remove-ops branch from 4b9ed66 to eebb7a2 Compare June 7, 2024 22:23

jerryzh168 requested a review from msaroufim June 7, 2024 23:46

msaroufim approved these changes Jun 9, 2024

View reviewed changes

msaroufim merged commit 8c8bc81 into pytorch:main Jun 9, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move quant ops to utils.py #331

Move quant ops to utils.py #331

jerryzh168 commented Jun 6, 2024 •

edited

Loading

pytorch-bot bot commented Jun 6, 2024 •

edited

Loading

msaroufim Jun 7, 2024

jerryzh168 Jun 7, 2024

Move quant ops to utils.py #331

Move quant ops to utils.py #331

Conversation

jerryzh168 commented Jun 6, 2024 • edited Loading

pytorch-bot bot commented Jun 6, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/331

❗ 1 Active SEVs

✅ No Failures

msaroufim Jun 7, 2024

Choose a reason for hiding this comment

jerryzh168 Jun 7, 2024

Choose a reason for hiding this comment

jerryzh168 commented Jun 6, 2024 •

edited

Loading

pytorch-bot bot commented Jun 6, 2024 •

edited

Loading