[ Misc ] non-uniform quantization via compressed-tensors for Llama#6515
Merged
robertgshaw2-redhat merged 28 commits intovllm-project:mainfrom Jul 19, 2024
Merged
[ Misc ] non-uniform quantization via compressed-tensors for Llama#6515robertgshaw2-redhat merged 28 commits intovllm-project:mainfrom
compressed-tensors for Llama#6515robertgshaw2-redhat merged 28 commits intovllm-project:mainfrom
Commits
Commits on Jul 17, 2024
- committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com
Commits on Jul 18, 2024
- committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com - committed
rshaw@neuralmagic.com