Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DRAFT] use cutlass for 24 #33

Draft
wants to merge 53 commits into
base: main
Choose a base branch
from
Draft

[DRAFT] use cutlass for 24 #33

wants to merge 53 commits into from

Commits on Oct 22, 2024

  1. Configuration menu
    Copy the full SHA
    5d51361 View commit details
    Browse the repository at this point in the history

Commits on Oct 28, 2024

  1. Update with test code

    Faraz9877 committed Oct 28, 2024
    Configuration menu
    Copy the full SHA
    17f5b96 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2024

  1. Configuration menu
    Copy the full SHA
    471a03c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0b332fb View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2024

  1. Configuration menu
    Copy the full SHA
    da31648 View commit details
    Browse the repository at this point in the history
  2. Fix A100 int8 tests

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    e655f94 View commit details
    Browse the repository at this point in the history
  3. Add fp8 cusparseLt

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    5fc3c1c View commit details
    Browse the repository at this point in the history
  4. wip

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    9cf36d6 View commit details
    Browse the repository at this point in the history
  5. Fix signatures

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    ad09e79 View commit details
    Browse the repository at this point in the history
  6. Fix compilation and tests

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    e75eabc View commit details
    Browse the repository at this point in the history
  7. Update for older platforms

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    0306390 View commit details
    Browse the repository at this point in the history
  8. Add benchmarks

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    1021acb View commit details
    Browse the repository at this point in the history
  9. Fix typo

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    19ce358 View commit details
    Browse the repository at this point in the history
  10. Added scaled_mm for fp8.

    Removed cmake check for cusparseLt, needs to be reverted when the cmake issue is resolved.
    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    959408c View commit details
    Browse the repository at this point in the history
  11. Add docstrings

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    117b87b View commit details
    Browse the repository at this point in the history
  12. Update for torch 2.5

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    2c7e68e View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    922f4f8 View commit details
    Browse the repository at this point in the history
  14. Add fp8 cusparseLt

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    beca038 View commit details
    Browse the repository at this point in the history
  15. Fix compilation and tests

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    5d9cd25 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    39ad9d4 View commit details
    Browse the repository at this point in the history
  17. Cached cusparseLt

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    520eb62 View commit details
    Browse the repository at this point in the history
  18. Fix destroy function

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    20956e6 View commit details
    Browse the repository at this point in the history
  19. Prepare for reproduce

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    87c8088 View commit details
    Browse the repository at this point in the history
  20. Fix cusparseLt caching

    ilmarkov committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    4ea58b1 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    f0551ef View commit details
    Browse the repository at this point in the history

Commits on Nov 6, 2024

  1. Configuration menu
    Copy the full SHA
    d7476e8 View commit details
    Browse the repository at this point in the history
  2. add sparse 2:4 weight loading suport

    dsikka authored and rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    681ea5e View commit details
    Browse the repository at this point in the history
  3. Some more changes!

    rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    ecf878f View commit details
    Browse the repository at this point in the history
  4. Cleanup

    rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    80952dc View commit details
    Browse the repository at this point in the history
  5. get uncompressed to work; update gemm to use contiguous; use alex's u…

    …tils instead of our decompressor
    dsikka authored and rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    8462c9d View commit details
    Browse the repository at this point in the history
  6. patch

    dsikka authored and rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    0a3e506 View commit details
    Browse the repository at this point in the history
  7. use our decompressor

    dsikka authored and rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    2e28972 View commit details
    Browse the repository at this point in the history
  8. Some more work

    rahul-tuli committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    28f0abb View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2024

  1. Use new scaled_T function

    rahul-tuli committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    c7a97a8 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2024

  1. Configuration menu
    Copy the full SHA
    ccadad0 View commit details
    Browse the repository at this point in the history
  2. Add multi-GPU

    Faraz9877 committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    807737c View commit details
    Browse the repository at this point in the history

Commits on Nov 14, 2024

  1. Configuration menu
    Copy the full SHA
    04c19a5 View commit details
    Browse the repository at this point in the history
  2. Clean up

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    2a85c5a View commit details
    Browse the repository at this point in the history
  3. Update code

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    1b381c9 View commit details
    Browse the repository at this point in the history
  4. Update code

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    4e31076 View commit details
    Browse the repository at this point in the history
  5. Clean up the benchmarking

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    13fccf4 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b345cc8 View commit details
    Browse the repository at this point in the history
  7. Fix cmake errors

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    2d03e1d View commit details
    Browse the repository at this point in the history
  8. Fix the cmake TAG

    Faraz9877 committed Nov 14, 2024
    Configuration menu
    Copy the full SHA
    e9439cc View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2024

  1. Configuration menu
    Copy the full SHA
    4ba7c0f View commit details
    Browse the repository at this point in the history
  2. update

    robertgshaw2-neuralmagic committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    f74ef37 View commit details
    Browse the repository at this point in the history
  3. fixed

    robertgshaw2-neuralmagic committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    f5bc9eb View commit details
    Browse the repository at this point in the history
  4. updated

    robertgshaw2-neuralmagic committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    1316076 View commit details
    Browse the repository at this point in the history
  5. updated

    robertgshaw2-neuralmagic committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    4d2b12c View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    fe30b53 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    4c61b19 View commit details
    Browse the repository at this point in the history
  8. updated

    robertgshaw2-neuralmagic committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    86716f8 View commit details
    Browse the repository at this point in the history
  9. Some cleanup

    rahul-tuli committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    c796ac8 View commit details
    Browse the repository at this point in the history