-
Notifications
You must be signed in to change notification settings - Fork 255
moe mxfp4 block_m = 64/128 #1266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
27 commits
Select commit
Hold shift + click to select a range
8014e68
moe mxfp4 block_m = 64/128
xudoyuan 52c55aa
update a4w4_gemm2_kernels_list
xudoyuan 88a72c2
add instance tile_m=32
lalala-sh 049a8d2
tuned configuration
zhiding512 125d488
Update test_moe_2stage.py
lalala-sh 8793a35
refactor
xudoyuan b65e405
update v1 pipeline
lalala-sh b82fe1f
update badcase
lalala-sh 4b2594a
fix fp4 moe tuner
lalala-sh e485e7f
reformat
lalala-sh c07d2e4
Merge remote-tracking branch 'origin/main' into moe_mxfp4_ck_64_128
lalala-sh f0b7911
revert ck update
lalala-sh 55e0b33
update ck
lalala-sh c1da914
Merge branch 'main' into moe_mxfp4_ck_64_128
lalala-sh 685fd10
Moe mxfp4 ck preshf bns (#1312)
xudoyuan a62e3db
add AITER_MXFP4_MOE_SF switch for mxfp4 moe
lalala-sh 0b712a2
v3 n128
zhiding512 9fba0c5
32x32 v1
zhiding512 f91773a
resolve ck conflict
xudoyuan 9458aa8
Merge branch 'main' into moe_mxfp4_ck_64_128
xudoyuan f9c9097
rm use_int4=True
xudoyuan 9fe0d84
reformatted op_tests/test_moe_2stage.py
xudoyuan e4fbdbe
AITER_MXFP4_MOE_SF bugfix
zhiding512 73128e7
Merge branch 'main' into moe_mxfp4_ck_64_128
xudoyuan 9eea40a
revert torch.int4
xudoyuan 27801bf
Merge branch 'main' into moe_mxfp4_ck_64_128
xudoyuan b3fe899
Merge branch 'main' into moe_mxfp4_ck_64_128
coderfeli File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.