CUDA backend: 3-bit uniform KV cache (turbo3, 4.6x compression, 96% f16 speed)#15
Closed
nalditopr wants to merge 95 commits into
Closed
Commits
Commits on Mar 26, 2026
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- committed
- committed
- andcommitted
- committed
Commits on Mar 27, 2026
- committed
- andcommitted
- andcommitted
- committed
- andcommitted
- andcommitted
- committed
- committed
- committed
- committed
- andcommitted
experiment: named-register centroid×norm — 4 constant reads upfront, zero divergence, ternary select
committed- committed
- committed
- andcommitted
- committed
- andcommitted
- committed
- andcommitted
- committed
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
Merge remote-tracking branch 'upstream/feature/turboquant-kv-cache' into feature/turboquant-kv-cache
committed- authored
- committed
Commits on Mar 28, 2026
- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted

- andcommitted
