Skip to content

llama.cpp with BakLLaVA model describes what does it see

Notifications You must be signed in to change notification settings

Anon2578/realtime-bakllava

 
 

Repository files navigation

🍰 Bakllava Llama C++ Tutorial 🦙

Welcome to the delicious world of Bakllava Llama with C++! Follow these steps to get your code running and indulge in AI sweetness! 😋

🚨 Properly tested only with Apple silicon chip

youtube installation guide

similar relevant project: Be My Eyes" web app

🚀 Step 1: Install Llama C++

First things first, let's get the Llama C++ installed.

🔗 Clone the repository from GitHub:

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp

On Linux & macOS:

🛠 Build with make:

make

🏗 Or, if you prefer cmake:

cmake --build . --config Release

📦 Step 2: Download the Model!

  1. 📥 Download from Hugging Face - mys/ggml_bakllava-1 this 2 files:
  • 🌟 ggml-model-q4_k.gguf (or any other quantized model) - only one is required!
  • 🧊 mmproj-model-f16.gguf
  1. ✂️ Copy the paths of those 2 files.

  2. 🏃‍♂️ Run this in the llama.cpp repository (replace YOUR_PATH with the paths to the files you downloaded):

    macOS

    ./server -m YOUR_PATH/ggml-model-q4_k.gguf --mmproj YOUR_PATH/mmproj-model-f16.gguf -ngl 1
    

    Windows

    server.exe -m REPLACE_WITH_YOUR_PATH\ggml-model-q4_k.gguf --mmproj REPLACE_WITH_YOUR_PATH\mmproj-model-f16.gguf -ngl 1
    
    
  3. 🎉 The llama server is now up and running!

    ⚠️ NOTE: Keep the server running in the background.

  4. 📹 Let's run the script to use the webcam or send it a single picture!

🏃‍♀️ Step 3: Running the Demo

Open a new terminal window and clone the demo app:

git clone https://github.com/Fuzzy-Search/realtime-bakllava.git
cd realtime-bakllava

🛠 (Optional) Create a new Python virtual environment and activate it

python3 -m venv bakllava-venv
source bakllava-venv/bin/activate
pip3 install -r requirements.txt

🎥 Webcam Script

To start streaming from your webcam:

! if you have problem with FFMPEG lib, download the source code and in file src/video_stream.py modify second line of code

python3 src/video_stream.py

🖼 Simple Picture Drop

Export-1699182386675

pip install -r picture_requirements.txt
python src/picture_drop.py --path src/sample_pic.png

📝 Enjoy your adventure with Llama C++! 🚀🦙

Star History

Star History Chart

About

llama.cpp with BakLLaVA model describes what does it see

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%