Status of prototype features #1807

msaroufim · 2025-03-01T19:32:55Z

I was parsing through our prototype folder and wanted to give my take on what should be promoted, deleted or requires further discussion

I'd love to hear more on folks especially if you disagree with anything!

cc @supriyar @jerryzh168 @drisspg @vkuzo @gau-nernst

The text was updated successfully, but these errors were encountered:

vkuzo · 2025-03-02T15:16:45Z

float8nocompile: This doesn't feel like it should be a prototype feature

I don't think this should be a separate feature. If we decide to polish this, IMO it should be a setting in the regular float8/int8/mx flows to use fused eager mode kernels. I'd want to make sure overall UX does not regress with respect to activation checkpointing and that performance is actually compelling on real models important today (large enough gemms) before shipping this.

danielvegamyhre · 2025-03-02T16:13:08Z

float8nocompile: This doesn't feel like it should be a prototype feature and I'd like to hear some detail on the promotion plan @danielvegamyhre

IMO the main blocker to promoting this from prototype is better composability with AC, for which we need to implement the feature request here pytorch/pytorch#144928 From my conversations with Jeffrey my understanding is he agrees it would be a useful feature, has some ideas in mind about how to implement it, and is planning to do it some time this half (cc @soulitzer please correct me if I'm mistaken about this).

If we decide to polish this, IMO it should be a setting in the regular float8/int8/mx flows to use fused eager mode kernels.

This is an interesting idea as well, I'd be interested in exploring that once the AC API described in the feature request has landed.

I'd want to make sure overall UX does not regress with respect to activation checkpointing and that performance is actually compelling on real models important today (large enough gemms) before shipping this.

+1

supriyar · 2025-03-03T18:49:00Z

Agree with you on the things we should deprecate/delete. We can perhaps do them during our next BE day/week? cc @andrewor14

For the rest like quantization algorithms or sparsity I'll defer to @jerryzh168 and @jcaip to share their thoughts.

jcaip · 2025-03-04T18:20:57Z

cc @msaroufim

For sparsity: 2:4, marlin, and BSR all have been promoted out of prototype, the only things that remain are:

the old structured pruner / sparsifier which is used for masking - I am in favor of deleting this, as my general sense is that people are doing their own masking. But we'll need to update this tutorial that currently use this API.
some superblock eval / train code (the actual implementation is in torchao.sparsity) - I would like to delete this half as 90% of this is shared with the reference torchvision implementation.

supriyar mentioned this issue Mar 3, 2025

Dora broken and tests don't run on CI #1800

Closed

msaroufim added tracker rfc topic: deprecation Use this tag if this PR deprecates a feature labels Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Status of prototype features #1807

Status of prototype features #1807

msaroufim commented Mar 1, 2025 •

edited

Loading

vkuzo commented Mar 2, 2025

danielvegamyhre commented Mar 2, 2025 •

edited

Loading

supriyar commented Mar 3, 2025

jcaip commented Mar 4, 2025

Status of prototype features #1807

Status of prototype features #1807

Comments

msaroufim commented Mar 1, 2025 • edited Loading

vkuzo commented Mar 2, 2025

danielvegamyhre commented Mar 2, 2025 • edited Loading

supriyar commented Mar 3, 2025

jcaip commented Mar 4, 2025

msaroufim commented Mar 1, 2025 •

edited

Loading

danielvegamyhre commented Mar 2, 2025 •

edited

Loading