Skip to content

perf: turbo VEC flash attention — +9% decode on CUDA via autoresearch#53

Open
signalnine wants to merge 153 commits into
TheTom:feature/turboquant-kv-cachefrom
signalnine:pr/fattn-vec-turbo-opts
Open

perf: turbo VEC flash attention — +9% decode on CUDA via autoresearch#53
signalnine wants to merge 153 commits into
TheTom:feature/turboquant-kv-cachefrom
signalnine:pr/fattn-vec-turbo-opts

Commits

Commits on Apr 2, 2026

Commits on Apr 3, 2026

Commits on Apr 6, 2026

Commits on Apr 7, 2026

Commits on Apr 8, 2026

Commits on Apr 9, 2026