Skip to content

Commit 1c286c3

Browse files
authored
docs(mixtral): add mixtral example (#1449)
1 parent 2f7beb6 commit 1c286c3

File tree

4 files changed

+35
-0
lines changed

4 files changed

+35
-0
lines changed

examples/configurations/README.md

+17
Original file line numberDiff line numberDiff line change
@@ -64,4 +64,21 @@ wget https://huggingface.co/mys/ggml_bakllava-1/resolve/main/mmproj-model-f16.gg
6464
curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
6565
"model": "llava",
6666
"messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'
67+
68+
```
69+
70+
### Mixtral
71+
72+
```
73+
cp -r examples/configuration/mixtral/* models/
74+
wget https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF/resolve/main/mixtral-8x7b-instruct-v0.1.Q2_K.gguf -O models/mixtral-8x7b-instruct-v0.1.Q2_K.gguf
75+
```
76+
77+
#### Test it out
78+
79+
```
80+
curl http://localhost:8080/v1/completions -H "Content-Type: application/json" -d '{
81+
"model": "mixtral",
82+
"prompt": "How fast is light?",
83+
"temperature": 0.1 }'
6784
```
+1
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
[INST] {{.Input}} [/INST]
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
[INST] {{.Input}} [/INST]
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
context_size: 512
2+
f16: true
3+
threads: 11
4+
gpu_layers: 90
5+
name: mixtral
6+
mmap: true
7+
parameters:
8+
model: mixtral-8x7b-instruct-v0.1.Q2_K.gguf
9+
temperature: 0.2
10+
top_k: 40
11+
top_p: 0.95
12+
batch: 512
13+
tfz: 1.0
14+
template:
15+
chat: mixtral-chat
16+
completion: mixtral

0 commit comments

Comments
 (0)