[Float8] Make Inference and Training code independent #808

drisspg · 2024-09-04T19:20:42Z

Stacked PRs:

Summary

Remove Old FP8 inference flow and and switch over to Float8MMConfig
in favor of the newly added support to AQT + quantize_ apis. It also completely separates and utilization on training code by creating a new Float8MMConfig and a addmm wrapper in the inference.py file

pytorch-bot · 2024-09-04T19:20:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/808

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c5c333e with merge base f5703b0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torchao/float8/float8_tensor.py

vkuzo · 2024-09-04T20:49:38Z

torchao/float8/inference.py

-
-    Note:
-        If the input tensor is already in Float8 format, it is returned as is without re-casting.
+class Float8MMConfig(NamedTuple):


can we make the names ScaledMMConfig and Float8MMConfig not be confusing? Ideally it should be clear why there are two objects and in which context the user should use which object

Sure, I was going to do ScaledMMConfigInference but that is even too verbose for me, any suggestions?

for training I went with a user facing Float8GemmConfig which is a dataclass and easy to explain (IMO), the way the data gets passed around is not user facing so details of ScaledMMConfig don't matter as much

tbh making ScaledMMConfig public and reusable between training + inference sounds right to me, if you really want this to be public without a dataclass wrapper

It can not be a dataclass today because that wont work with compile, Lazos has a PR to make frozen dataclasses proxyable but until then it has to be a named tuple.

I love dataclasses but I dont understand why they are more understandable then a namedtuple, Is the problem that this name is similar to other names?

I think it's less about dataclass versus named tuple, and more about the field readability / understandability / future proofness, etc. For example, if we made ScaledMMConfig have an output dtype enum instead of a boolean and ensured all the other args are consistent with other public APIs, I think that would be fine.

https://stackoverflow.com/a/18348004/1058521 is one minor reason, default values look nicer with dataclasses

stack-info: PR: #808, branch: drisspg/stack/9

drisspg force-pushed the drisspg/stack/9 branch from 71bc491 to ec334af Compare September 4, 2024 19:20

drisspg mentioned this pull request Sep 4, 2024

[StaticQuant] add a linear observer class and test #807

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 4, 2024

drisspg requested a review from vkuzo September 4, 2024 19:21

vkuzo reviewed Sep 4, 2024

View reviewed changes

torchao/float8/float8_tensor.py Show resolved Hide resolved

vkuzo reviewed Sep 4, 2024

View reviewed changes

torchao/float8/float8_tensor.py Show resolved Hide resolved

drisspg force-pushed the drisspg/stack/9 branch from ec334af to 9583709 Compare September 4, 2024 20:39

drisspg changed the title ~~[Float8Configs] Make named tuples have better docs + public~~ [Float8Configs] Remove Old FP8 inference flow and and switch over to Float8MMCOnfig Sep 4, 2024

drisspg changed the title ~~[Float8Configs] Remove Old FP8 inference flow and and switch over to Float8MMCOnfig~~ [Float8Configs] Remove Old FP8 inference flow and and switch over to Float8MMConfig Sep 4, 2024

drisspg force-pushed the drisspg/stack/9 branch from 9583709 to 07b71e4 Compare September 4, 2024 20:42

drisspg changed the title ~~[Float8Configs] Remove Old FP8 inference flow and and switch over to Float8MMConfig~~ [Float8Configs] Make named tuples have better docs + public Sep 4, 2024

drisspg requested a review from vkuzo September 4, 2024 20:46

drisspg force-pushed the drisspg/stack/9 branch from 07b71e4 to ca125ea Compare September 4, 2024 20:46

drisspg changed the title ~~[Float8Configs] Make named tuples have better docs + public~~ [Float8] Make Inference and Training code independent Sep 4, 2024

vkuzo reviewed Sep 4, 2024

View reviewed changes

[Float8] Make Inference and Training code independent

c5c333e

stack-info: PR: #808, branch: drisspg/stack/9

vkuzo approved these changes Sep 4, 2024

View reviewed changes

drisspg force-pushed the drisspg/stack/9 branch from ca125ea to c5c333e Compare September 4, 2024 20:50

drisspg merged commit 848e123 into main Sep 4, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Float8] Make Inference and Training code independent #808

[Float8] Make Inference and Training code independent #808

drisspg commented Sep 4, 2024 •

edited

Loading

pytorch-bot bot commented Sep 4, 2024 •

edited

Loading

vkuzo Sep 4, 2024

drisspg Sep 4, 2024

vkuzo Sep 4, 2024

drisspg Sep 4, 2024

vkuzo Sep 4, 2024

[Float8] Make Inference and Training code independent #808

[Float8] Make Inference and Training code independent #808

Conversation

drisspg commented Sep 4, 2024 • edited Loading

Summary

pytorch-bot bot commented Sep 4, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/808

✅ No Failures

vkuzo Sep 4, 2024

Choose a reason for hiding this comment

drisspg Sep 4, 2024

Choose a reason for hiding this comment

vkuzo Sep 4, 2024

Choose a reason for hiding this comment

drisspg Sep 4, 2024

Choose a reason for hiding this comment

vkuzo Sep 4, 2024

Choose a reason for hiding this comment

drisspg commented Sep 4, 2024 •

edited

Loading

pytorch-bot bot commented Sep 4, 2024 •

edited

Loading