Skip to content
This repository has been archived by the owner on Jul 17, 2024. It is now read-only.

Releases: instill-ai/model-mistral-7b-dvc

for-test

07 Jan 06:55
9debb52
Compare
Choose a tag to compare
for-test Pre-release
Pre-release

for-test

fp16-7b-vllm-a100

09 Nov 10:44
203ed78
Compare
Choose a tag to compare

fp16-7b-vllm-a100

fp16-7b-vllm-p80-2gpu

21 Oct 17:19
5bbd606
Compare
Choose a tag to compare

Support Mistral-7b Text Completion Task via vLLM in Triton Inference Server's Python Operator, running in parallel with 2 GPU instances, each utilizing 80% of GPU memory.

fp16-7b-vllm-p80-1gpu

21 Oct 17:17
fd0cce1
Compare
Choose a tag to compare

Support Mistral-7b Text Completion Task via vLLM in Triton Inference Server's Python Operator, running not in parallel with only 1 gpu instance with utilizing 80% of GPU memory.