2:4 Sparse GEMM API #1728

tsengalb99 · 2025-02-18T20:08:16Z

I have a setup where I have a dense matrices $A, B$ and a 2:4 sparsity mask $M$. Is there an API in torchao where I can perform $A (B \odot M)^T$ and get the speedups from 2:4 GEMMs? That is, instead of having a pre-sparsified matrix $B$, I want to apply $M$ to $B$ online and then do the sparse GEMM.

jcaip · 2025-02-18T22:36:40Z

Hi @tsengalb99

We don't have a public API for this but you may be able to hack something together with

ao/torchao/sparsity/training/autograd.py

Line 148 in 988c5c9

def semi_structured_sparsify_like(

.

See https://github.com/pytorch/pytorch/blob/c9a15d980f249ad3697822476f658946d7907b44/test/test_sparse_semi_structured.py#L755 for an example of the private API torch._sparse_semi_structured_apply to use.

tsengalb99 · 2025-02-19T01:17:57Z

I tried doing the following:

sm = torch.sparse.to_sparse_structured(B*M)
y = torch.mm(sm, A.T).T

This sometimes works but I sometimes get
NotImplementedError: `SparseSemiStructuredTensorCUSPARSELT` matmul: operation is not supported
in reference to

[rank7]:   File "/home/alberttseng/miniconda3/lib/python3.12/site-packages/torch/sparse/_semi_structured_ops.py", line 122, in semi_sparse_mm
[rank7]:     res = A._mm(B_padded)

Do you know how to fix this?

jcaip · 2025-02-20T00:06:51Z

Do you have a script to repro @tsengalb99? I wouldn't expect a transient error here, I wonder what it could be.
In any case, I doubt your approach would be faster unless on very large matrices. I think to be faster you'd have to do something like outlined here. Note that this doesn't use to_sparse_semi_structured and uses torch._sparse_semi_structured_apply instead.

jcaip self-assigned this Feb 18, 2025

jcaip added sparsity triaged labels Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2:4 Sparse GEMM API #1728

2:4 Sparse GEMM API #1728

tsengalb99 commented Feb 18, 2025

jcaip commented Feb 18, 2025 •

edited

Loading

tsengalb99 commented Feb 19, 2025 •

edited

Loading

jcaip commented Feb 20, 2025 •

edited

Loading

2:4 Sparse GEMM API #1728

2:4 Sparse GEMM API #1728

Comments

tsengalb99 commented Feb 18, 2025

jcaip commented Feb 18, 2025 • edited Loading

tsengalb99 commented Feb 19, 2025 • edited Loading

jcaip commented Feb 20, 2025 • edited Loading

jcaip commented Feb 18, 2025 •

edited

Loading

tsengalb99 commented Feb 19, 2025 •

edited

Loading

jcaip commented Feb 20, 2025 •

edited

Loading