Skip to content

Conversation

@jinyangyuan-nvidia
Copy link
Collaborator

@jinyangyuan-nvidia jinyangyuan-nvidia commented Apr 25, 2025

@PerkzZheng fixes a bug of FMHA-based MLA in the generation phase (cubins are generated using commit 56491367f0a36a4f7bba68d79b4c04f369dd5a91).
@peaceh-nv modifies the code for SM120.
@jinyangyuan-nvidia adds MLA unit test.
This PR also cherry-picks the modifications in #3675.

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3382 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3382 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #2367 completed with status: 'FAILURE'

@jinyangyuan-nvidia jinyangyuan-nvidia force-pushed the dev/chore_mla_ut branch 2 times, most recently from 8befdc7 to bd144b7 Compare April 25, 2025 14:46
@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3399 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3399 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #2383 completed with status: 'FAILURE'

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3510 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3510 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #2472 completed with status: 'FAILURE'

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3532 [ run ] triggered by Bot

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot kill

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3548 [ kill ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3549 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3548 [ kill ] completed with state ABORTED

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3532 [ run ] completed with state ABORTED

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot kill

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3560 [ kill ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3549 [ run ] completed with state ABORTED

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3560 [ kill ] completed with state SUCCESS
Successfully killed previous jobs for commit 34c549d

@jinyangyuan-nvidia
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3600 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #3600 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #2539 completed with status: 'SUCCESS'

@jinyangyuan-nvidia jinyangyuan-nvidia enabled auto-merge (squash) April 28, 2025 15:36
PerkzZheng and others added 7 commits April 29, 2025 09:06
Signed-off-by: Perkz Zheng <[email protected]>
Signed-off-by: peaceh <[email protected]>
Signed-off-by: Dylan Chen <[email protected]>
Signed-off-by: Dylan Chen <[email protected]>
Signed-off-by: Jinyang Yuan <[email protected]>
Signed-off-by: Jinyang Yuan <[email protected]>
@niukuo niukuo merged commit dafc28f into NVIDIA:main Apr 29, 2025
1 check passed
@jinyangyuan-nvidia jinyangyuan-nvidia deleted the dev/chore_mla_ut branch April 29, 2025 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants