We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent dd5ae06 commit a9a8c5dCopy full SHA for a9a8c5d
README.md
@@ -10,6 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
10
11
### Hot topics
12
13
+- New SOTA quantized models, including pure 2-bits: https://huggingface.co/ikawrakow
14
- Collecting Apple Silicon performance stats:
15
- M-series: https://github.com/ggerganov/llama.cpp/discussions/4167
16
- A-series: https://github.com/ggerganov/llama.cpp/discussions/4508
0 commit comments