-
Notifications
You must be signed in to change notification settings - Fork 338
Int4 sparse marlin tensor #2771
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2771
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 7de0b12 with merge base 2eae09b ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
torchao/quantization/quantize_/workflows/int4/int4_marlin_sparse_tensor.py
Show resolved
Hide resolved
test/quantization/quantize_/workflows/int4/test_int4_marlin_sparse_tensor.py
Outdated
Show resolved
Hide resolved
torchao/quantization/quantize_/workflows/int4/int4_marlin_sparse_tensor.py
Show resolved
Hide resolved
torchao/quantization/quantize_/workflows/int4/int4_marlin_sparse_tensor.py
Outdated
Show resolved
Hide resolved
425e72d
to
9f2ae7c
Compare
torchao/quantization/quantize_/workflows/int4/int4_marlin_sparse_tensor.py
Outdated
Show resolved
Hide resolved
torchao/quantization/quantize_/workflows/int4/int4_marlin_sparse_tensor.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good, thanks!
* added marlin sparse to packing format, inital commit * deleting unnecessary functions * packing * linear * add call to from_hp * unit test * fix test_linear * formatting * remove comments * update VERSION to version * fix module path unit test * adding sizes to linear unit test * move pre_process and from_plain to from_hp * compile test
No description provided.