Skip to content

CUDA: optimize FA for GQA + large batches#12014

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-23
Feb 22, 2025
Merged

CUDA: optimize FA for GQA + large batches#12014
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-23

Commits

Commits on Feb 21, 2025