Skip to content

[Revision] Add fast decode plan for flashinfer mla #4012

Merged
zhyncs merged 3 commits intosgl-project:mainfrom
Fridge003:deepseek
Mar 5, 2025
Merged

[Revision] Add fast decode plan for flashinfer mla #4012
zhyncs merged 3 commits intosgl-project:mainfrom
Fridge003:deepseek

Conversation

@Fridge003
Copy link
Collaborator

Motivation

Revision of #3987

@merrymercy @zhyncs @Ying1123

Modifications

Checklist

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants