Skip to content

CUDA: Improve performance via less synchronizations between token#17795

Merged
ggerganov merged 13 commits intoggml-org:masterfrom
aendk:akieslinger/reduce-per-token-syncs
Mar 5, 2026
Merged

CUDA: Improve performance via less synchronizations between token#17795
ggerganov merged 13 commits intoggml-org:masterfrom
aendk:akieslinger/reduce-per-token-syncs

Commits

Commits on Feb 9, 2026