Pinned Loading
-
easy-llama
easy-llama PublicPython package wrapping llama.cpp for on-device LLM inference
-
reap
reap PublicForked from CerebrasResearch/reap
REAP: Router-weighted Expert Activation Pruning for SMoE compression
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


