Skip to content

CUDA: route batch>=4 quantized matmul to MMQ on AMD MFMA hardware#23227

Merged
JohannesGaessler merged 3 commits into
ggml-org:masterfrom
jadenmach2:cdna-mmq-batch4
May 28, 2026
Merged

CUDA: route batch>=4 quantized matmul to MMQ on AMD MFMA hardware#23227
JohannesGaessler merged 3 commits into
ggml-org:masterfrom
jadenmach2:cdna-mmq-batch4

Commits

Commits on May 19, 2026

Commits on May 21, 2026