changwangss

Wang, Chang changwangss

Achievements

intel/neural-compressor intel/neural-compressor Public

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2.5k 282
EleutherAI/lm-evaluation-harness EleutherAI/lm-evaluation-harness Public

A framework for few-shot evaluation of language models.

Python 10.3k 2.8k
intel/intel-extension-for-transformers intel/intel-extension-for-transformers Public archive

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2.2k 214
optimum optimum Public

Forked from huggingface/optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python
optimum-intel optimum-intel Public

Forked from huggingface/optimum-intel

Accelerate inference of 🤗 Transformers with Intel optimization tools

Jupyter Notebook
transformers transformers Public

Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Python