[Cutlass] Add group gemm kernels #16751

vinx13 · 2024-03-20T00:25:50Z

This adds sm90a group gemm kernels from cutlass. It includes cmake improvement from #16638 .

Supersedes #16638.

Co-authored-by: Chris Sullivan [email protected]
Co-authored-by: masahi [email protected]

… versions. * Each cutlass-based submodule library now uses its own cutlass submodule dependancy * TVM's cutlass submodule is decoupled from others and is bumped to v3.4.1 for H100 support * Add scaffold for new cutlass fp8 dequant gemm interface targetting TVM's cutlass submodule

…nger used upstream.

* [CMAKE][CUTLASS] Improve dependancy management with different cutlass versions. * Each cutlass-based submodule library now uses its own cutlass submodule dependancy * TVM's cutlass submodule is decoupled from others and is bumped to v3.4.1 for H100 support * Add scaffold for new cutlass fp8 dequant gemm interface targetting TVM's cutlass submodule * Remove handling for moe_gemm.cc and flash_decoding.cu which are no longer used upstream. * Add cutlass fp8 group gemm * Add fp16 grouped gemm support for sm90 * [Cutlass] Support alpha scaling in fp8 group gemm * [Cutlass] Support device alpha_ptr for fp8 group gemm --------- Co-authored-by: Chris Sullivan <[email protected]> Co-authored-by: masahi <[email protected]>

csullivan and others added 9 commits March 19, 2024 21:46

Remove handling for moe_gemm.cc and flash_decoding.cu which are no lo…

05c9ba6

…nger used upstream.

Add cutlass fp8 group gemm

1ba99be

Add fp16 grouped gemm support for sm90

d91511a

[Cutlass] Support alpha scaling in fp8 group gemm

1c2ea79

[Cutlass] Support device alpha_ptr for fp8 group gemm

ca21a0b

fix

917652c

add test

3a41d44

Remove unused file and update cmake arch check

1c22b49

github-actions bot requested review from csullivan, masahi and tqchen March 20, 2024 00:26

fix cmake

8c6c6ea

tqchen approved these changes Mar 20, 2024

View reviewed changes

tqchen merged commit 89e9028 into apache:main Mar 20, 2024

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Cutlass] Add group gemm kernels #16751

[Cutlass] Add group gemm kernels #16751

Uh oh!

vinx13 commented Mar 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Cutlass] Add group gemm kernels #16751

[Cutlass] Add group gemm kernels #16751

Uh oh!

Conversation

vinx13 commented Mar 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants