Skip to content

[Attention] Support distinguishing between short extends and decodes#37303

Merged
LucasWilkinson merged 4 commits intovllm-project:mainfrom
neuralmagic:nemotron-h-mtp-4way-batch-split
Mar 20, 2026
Merged

[Attention] Support distinguishing between short extends and decodes#37303
LucasWilkinson merged 4 commits intovllm-project:mainfrom
neuralmagic:nemotron-h-mtp-4way-batch-split

Commits

Commits on Mar 18, 2026