CUDA: route batch>=4 quantized matmul to MMQ on AMD MFMA hardware #23227
+51
−0
background
wait
wait-all
cancel
Loading