Skip to content

Introduction of gemm4xN and gemmMx4 for Q4_0 and Q8_0 for better performance results#8908

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
Srihari-mcw:q8_0_q4_0_fp16_delta_multiply_parallel
Aug 31, 2024
Merged

Introduction of gemm4xN and gemmMx4 for Q4_0 and Q8_0 for better performance results#8908
ggerganov merged 1 commit intoggml-org:masterfrom
Srihari-mcw:q8_0_q4_0_fp16_delta_multiply_parallel

Commits