[Bugfix] Fix KV cache sizing and allocation for hybrid Mamba/attention models#37429
Open
swtb3 wants to merge 4 commits intovllm-project:mainfrom
Open
[Bugfix] Fix KV cache sizing and allocation for hybrid Mamba/attention models#37429swtb3 wants to merge 4 commits intovllm-project:mainfrom
swtb3 wants to merge 4 commits intovllm-project:mainfrom