Skip to content

Add GPTQ quantization kernels for 2, 3, 8-bit use cases#2223

Closed
JasonZhu1313 wants to merge 8 commits into
vllm-project:mainfrom
JasonZhu1313:JasonZhu1313/gptq_cuda_triton
Closed

Add GPTQ quantization kernels for 2, 3, 8-bit use cases#2223
JasonZhu1313 wants to merge 8 commits into
vllm-project:mainfrom
JasonZhu1313:JasonZhu1313/gptq_cuda_triton

Commits

Commits on Dec 20, 2023

Commits on Dec 21, 2023