Skip to content

[ Misc ] non-uniform quantization via compressed-tensors for Llama#6515

Merged
robertgshaw2-redhat merged 28 commits intovllm-project:mainfrom
neuralmagic:non-uniform
Jul 19, 2024
Merged

[ Misc ] non-uniform quantization via compressed-tensors for Llama#6515
robertgshaw2-redhat merged 28 commits intovllm-project:mainfrom
neuralmagic:non-uniform

Commits

Commits on Jul 17, 2024

Commits on Jul 18, 2024