Skip to content

[Bugfix] Fix KV cache sizing and allocation for hybrid Mamba/attention models#37429

Open
swtb3 wants to merge 4 commits intovllm-project:mainfrom
swtb3:fix/hybrid-mamba-compact-allocation
Open

[Bugfix] Fix KV cache sizing and allocation for hybrid Mamba/attention models#37429
swtb3 wants to merge 4 commits intovllm-project:mainfrom
swtb3:fix/hybrid-mamba-compact-allocation

Commits

Commits on Mar 18, 2026

Commits on Mar 19, 2026