Skip to content

CUDA: optimize and refactor MMQ#8416

Merged
JohannesGaessler merged 2 commits intoggml-org:masterfrom
JohannesGaessler:cuda-mmq-256k-5
Jul 11, 2024
Merged

CUDA: optimize and refactor MMQ#8416
JohannesGaessler merged 2 commits intoggml-org:masterfrom
JohannesGaessler:cuda-mmq-256k-5

Commits

Commits on Jul 10, 2024

Commits on Jul 11, 2024