CUDA: optimize FA for GQA + large batches#12014
Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom Feb 22, 2025 
Merged
CUDA: optimize FA for GQA + large batches#12014JohannesGaessler merged 1 commit intoggml-org:masterfrom 
JohannesGaessler merged 1 commit intoggml-org:masterfrom