avoid .to() followed by inplace mutation to appease export #1387

bdhirsh · 2024-12-06T15:10:10Z

Export currently has a restriction, where inplace mutations on the output of a call to aten.to() results in an error.

We should really lift this restriction, but in the meantime, this PR avoids hitting that problem in the quantization logic.

I tested by adding the following lines to the bottom of https://github.com/hustvl/ViTMatte/blob/main/modeling%2Fbackbone%2Fvit.py and confirming that export did not error:

quantize_(m, int8_dynamic_activation_int8_weight())
m = unwrap_tensor_subclass(m)

x = torch.randn(4, 3, 128, 128)
m = torch.export.export(m, (x,))
m2 = m.run_decompositions()
print(m2)

Stack from ghstack (oldest at bottom):

-> avoid .to() followed by inplace mutation to appease export #1387

[ghstack-poisoned]

pytorch-bot · 2024-12-06T15:10:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1387

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 08bbd50 with merge base 8a805d0 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh) (trunk failure)
test/integration/test_integration.py::TestSubclass::test_int8_dynamic_quant_subclass_api_5_cuda

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b979b7637c8d062b5f5a61c56e1e78edf1b70290 Pull Request resolved: #1387

bdhirsh · 2024-12-06T15:11:59Z

fyi @tugsbayasgalan @yushangdi

bhack · 2024-12-06T22:03:51Z

/opt/conda/lib/python3.11/site-packages/torchao/dtypes/uintx/block_sparse_layout.py:        y += bias
/opt/conda/lib/python3.11/site-packages/torchao/dtypes/uintx/plain_layout.py:        y += bias
/opt/conda/lib/python3.11/site-packages/torchao/dtypes/uintx/semi_sparse_layout.py:        y += bias
/opt/conda/lib/python3.11/site-packages/torchao/dtypes/uintx/tensor_core_tiled_layout.py:        y += bias
/opt/conda/lib/python3.11/site-packages/torchao/dtypes/uintx/uint4_layout.py:                y += bias
/opt/conda/lib/python3.11/site-packages/torchao/prototype/quantization/autoquant_v2.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/prototype/quantization/autoquant_v2.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/quantization/autoquant.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/quantization/autoquant.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/quantization/subclass.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/quantization/subclass.py:            y += bias
/opt/conda/lib/python3.11/site-packages/torchao/quantization/weight_only.py:            y += self.bias
```

jerryzh168 · 2024-12-07T02:22:48Z

thanks, we can't use ghstack in torchao btw, you can use https://github.com/modularml/stack-pr or just normal git push

jerryzh168 · 2024-12-07T02:23:22Z

I'm not sure if changing this would have perf implications actually, probably need to benchmark a bit for the popular code paths

bhack · 2024-12-07T02:24:38Z

I think it is better to have export coverage in the CI if we can.

bhack · 2024-12-07T04:13:51Z

Related pytorch/pytorch#138606

bhack · 2024-12-07T10:59:30Z

/cc @yushangdi

Wording of error message to include AOTI package

yushangdi · 2024-12-09T18:32:07Z

Related pytorch/pytorch#138606

fyi, @jerryzh168 @bhack , the approach in the PR linked above doesn't work. I'm trying out a new approach, mostly this:

if we have a.to(b), and b has a different dtype with a, then it must be a copy. In this case, we do not need to freeze the tensor. Instead, we use torch.ops.aten._assert_tensor_metadata.default to ensure that a must not have the same dtype as b.

cc @tugsbayasgalan

avoid .to() followed by inplace mutation to appease export

08bbd50

[ghstack-poisoned]

bdhirsh added a commit that referenced this pull request Dec 6, 2024

avoid .to() followed by inplace mutation to appease export

e57993d

ghstack-source-id: b979b7637c8d062b5f5a61c56e1e78edf1b70290 Pull Request resolved: #1387

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 6, 2024

bdhirsh requested a review from jerryzh168 December 6, 2024 15:11

bdhirsh mentioned this pull request Dec 6, 2024

unhashable type: non-nested SymInt #1381

Open

bdhirsh added the topic: bug fix Use this tag for PRs that fix bugs label Dec 6, 2024

yushangdi approved these changes Dec 6, 2024

View reviewed changes

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Update builder.py (pytorch#1387)

826c0c6

Wording of error message to include AOTI package

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid .to() followed by inplace mutation to appease export #1387

avoid .to() followed by inplace mutation to appease export #1387

bdhirsh commented Dec 6, 2024 •

edited

Loading

pytorch-bot bot commented Dec 6, 2024 •

edited

Loading

bdhirsh commented Dec 6, 2024

bhack commented Dec 6, 2024 •

edited

Loading

jerryzh168 commented Dec 7, 2024

jerryzh168 commented Dec 7, 2024

bhack commented Dec 7, 2024 •

edited

Loading

bhack commented Dec 7, 2024

bhack commented Dec 7, 2024

yushangdi commented Dec 9, 2024 •

edited

Loading

avoid .to() followed by inplace mutation to appease export #1387

Are you sure you want to change the base?

avoid .to() followed by inplace mutation to appease export #1387

Conversation

bdhirsh commented Dec 6, 2024 • edited Loading

pytorch-bot bot commented Dec 6, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1387

✅ You can merge normally! (1 Unrelated Failure)

bdhirsh commented Dec 6, 2024

bhack commented Dec 6, 2024 • edited Loading

jerryzh168 commented Dec 7, 2024

jerryzh168 commented Dec 7, 2024

bhack commented Dec 7, 2024 • edited Loading

bhack commented Dec 7, 2024

bhack commented Dec 7, 2024

yushangdi commented Dec 9, 2024 • edited Loading

bdhirsh commented Dec 6, 2024 •

edited

Loading

pytorch-bot bot commented Dec 6, 2024 •

edited

Loading

bhack commented Dec 6, 2024 •

edited

Loading

bhack commented Dec 7, 2024 •

edited

Loading

yushangdi commented Dec 9, 2024 •

edited

Loading