Skip to content

[Attention Backend] TurboQuant: 2-bit KV cache compression with 4x capacity#38479

Merged
vllm-bot merged 10 commits intovllm-project:mainfrom
vibhavagarwal5:feature/turboquant-kv-cache
Apr 15, 2026
Merged

[Attention Backend] TurboQuant: 2-bit KV cache compression with 4x capacity#38479
vllm-bot merged 10 commits intovllm-project:mainfrom
vibhavagarwal5:feature/turboquant-kv-cache

Commits

Commits on Apr 11, 2026

Commits on Apr 12, 2026

Commits on Apr 14, 2026