-
Notifications
You must be signed in to change notification settings - Fork 3.4k
[1/2] Add Kernel support for Cutlass based Fused FP4 MoE #6093
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 14 commits
Commits
Show all changes
15 commits
Select commit
Hold shift + click to select a range
7b39b5b
Initial FP4 MoE commit
pavanimajety ed4813a
remove get_sm_version
pavanimajety dafbd4e
Running changes
pavanimajety 7d5f998
add test and other necessary changes in CMakeLists.txt
pavanimajety ef8cd98
fix pre-commit
pavanimajety 3d47d32
Combine blockscale offsets to the prepare function to reduce overhead
pavanimajety 3c96bd4
more formatting changes and change flaky test
pavanimajety 2d79cfd
fix some warnings
pavanimajety 1e4cc24
apply format diff and fix constexpr warning
pavanimajety 42819c6
lint again
pavanimajety d306b5e
Add ab_strides and c_strides to the kernel and add shuffle_rows kernel
pavanimajety 29679bc
Fix shuffle_rows issue and lint
pavanimajety 0e717b0
Fix lint again
pavanimajety b89b9f0
Merge branch 'main' into cutlass_fp4_moe
pavanimajety c9ecc6e
Merge branch 'main' into cutlass_fp4_moe
zhyncs File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.