curname

Follow

curname curname

Follow

0 followers · 6 following

Achievements

Achievements

Popular repositories Loading

AutoAWQ AutoAWQ Public

Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

Python
vllm-gptq vllm-gptq Public

Forked from chu-tianxiang/vllm-gptq

A high-throughput and memory-efficient inference and serving engine for LLMs

Python