Skip to content

GPU-accelerated token generation (new quantization format)#1412

Merged
ggerganov merged 9 commits intoggml-org:masterfrom
JohannesGaessler:dequantize-matmul-4
May 13, 2023
Merged

GPU-accelerated token generation (new quantization format)#1412
ggerganov merged 9 commits intoggml-org:masterfrom
JohannesGaessler:dequantize-matmul-4

Commits