Skip to content

[WIP][Bugfix] Fix CUDA OOM in sparse_attn_indexer prefill with high concurrency#35488

Closed
haosdent wants to merge 1 commit intovllm-project:mainfrom
haosdent:fix-34553
Closed

[WIP][Bugfix] Fix CUDA OOM in sparse_attn_indexer prefill with high concurrency#35488
haosdent wants to merge 1 commit intovllm-project:mainfrom
haosdent:fix-34553

Commits

Commits on Mar 18, 2026