-
Couldn't load subscription status.
- Fork 1.8k
fix: Fix FMHA-based MLA in the generation phase and add MLA unit test #3863
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
/bot run |
|
PR_Github #3382 [ run ] triggered by Bot |
|
PR_Github #3382 [ run ] completed with state |
8befdc7 to
bd144b7
Compare
|
/bot run |
|
PR_Github #3399 [ run ] triggered by Bot |
|
PR_Github #3399 [ run ] completed with state |
bd144b7 to
c22e295
Compare
|
/bot run |
|
PR_Github #3510 [ run ] triggered by Bot |
|
PR_Github #3510 [ run ] completed with state |
c22e295 to
0af164b
Compare
|
/bot run |
|
PR_Github #3532 [ run ] triggered by Bot |
|
/bot kill |
0af164b to
34c549d
Compare
|
/bot run |
|
PR_Github #3548 [ kill ] triggered by Bot |
|
PR_Github #3549 [ run ] triggered by Bot |
|
PR_Github #3548 [ kill ] completed with state |
|
PR_Github #3532 [ run ] completed with state |
|
/bot kill |
|
PR_Github #3560 [ kill ] triggered by Bot |
|
PR_Github #3549 [ run ] completed with state |
|
PR_Github #3560 [ kill ] completed with state |
34c549d to
b6d78e5
Compare
b6d78e5 to
6cbfd32
Compare
|
/bot run |
|
PR_Github #3600 [ run ] triggered by Bot |
|
PR_Github #3600 [ run ] completed with state |
Signed-off-by: Perkz Zheng <[email protected]>
Signed-off-by: peaceh <[email protected]>
Signed-off-by: Dylan Chen <[email protected]>
Signed-off-by: Dylan Chen <[email protected]>
Signed-off-by: Jinyang Yuan <[email protected]>
Signed-off-by: Jinyang Yuan <[email protected]>
6cbfd32 to
a9cec22
Compare
@PerkzZheng fixes a bug of FMHA-based MLA in the generation phase (cubins are generated using commit 56491367f0a36a4f7bba68d79b4c04f369dd5a91).
@peaceh-nv modifies the code for SM120.
@jinyangyuan-nvidia adds MLA unit test.
This PR also cherry-picks the modifications in #3675.