Skip to content

Apply heuristic for DeepSeek MLA prefill splitting#126

Merged
tlrmchlsmth merged 1 commit intovllm-project:mainfrom
MatthewBonanni:split_heuristic
Mar 11, 2026
Merged

Apply heuristic for DeepSeek MLA prefill splitting#126
tlrmchlsmth merged 1 commit intovllm-project:mainfrom
MatthewBonanni:split_heuristic

Conversation

@MatthewBonanni
Copy link

@MatthewBonanni MatthewBonanni commented Mar 11, 2026

This was intended to be part of #123 (see comment). Improves performance compared to always adjusting tile size

559494626-b96cdd34-9c9a-4a27-95ee-73f437e5d80d

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
@tlrmchlsmth tlrmchlsmth merged commit 1488682 into vllm-project:main Mar 11, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants