Skip to content

[TurboQuant] Reduce TurboQuant KV memory loss by deduplicating decode scratch buffers#40706

Open
lesj0610 wants to merge 2 commits intovllm-project:mainfrom
lesj0610:lesj/tq-decode-workspace-dedup
Open

[TurboQuant] Reduce TurboQuant KV memory loss by deduplicating decode scratch buffers#40706
lesj0610 wants to merge 2 commits intovllm-project:mainfrom
lesj0610:lesj/tq-decode-workspace-dedup

Commits

Commits on May 4, 2026

Commits on May 8, 2026