Moondream Streamlit Interface

A web interface for the Moondream vision language model, built with Streamlit. Upload images, get descriptions, and chat about the images using natural language.

Features

🖼️ Image upload and analysis
💬 Interactive chat about images
🚀 Local processing for privacy
🎯 CUDA support for faster processing
📥 Automatic model weight downloading with progress tracking

Prerequisites

Python 3.8+
NVIDIA GPU with CUDA support (recommended)
Microsoft Visual C++ Redistributable 2019 (Download here)
~2GB disk space for model weights

Installation

Clone the Repository

git clone https://github.com/yourusername/moondream-streamlit.git
cd moondream-streamlit

Create and Activate Virtual Environment

# Remove existing venv if present
deactivate  # If in a virtual environment
rmdir /s /q venv  # On Windows
rm -rf venv      # On Linux/Mac

# Create new environment
python -m venv venv

# Activate environment
# On Windows:
venv\Scripts\activate
# On Linux/Mac:
source venv/bin/activate

Install PyTorch with CUDA Support

pip3 install torch==2.5.1+cu121 torchvision==0.20.1+cu121 --index-url https://download.pytorch.org/whl/cu121

Install Other Dependencies

pip install -r requirements.txt

Download Model Weights

# Download gzipped weights
wget https://huggingface.co/vikhyatk/moondream2/resolve/client/moondream-latest-int8.bin.gz

Important Note About Model Format

The Moondream library expects the model in a specific TAR archive format. The application handles format conversion automatically:

When starting up, it checks for and processes the model files in this order:

First looks for the gzipped file (moondream-latest-int8.bin.gz)
- If found, extracts it to a .bin file
Then looks for the extracted .bin file (moondream-latest-int8.bin)
- If found, processes it by: a. Extracting contents to a temporary directory b. Creating a proper TAR archive with the required structure c. Cleaning up temporary files

The final TAR archive will contain:

vision_encoder.onnx
vision_projection.onnx
text_encoder.onnx
text_decoder files
tokenizer.json
initial_kv_caches.npy
config.json

The resulting file will be named moondream-latest-mtb.tar.

Running the Application

Start the Streamlit server:

streamlit run app/main.py

The application will be available at http://localhost:8501

On first run, if you haven't already downloaded the model weights, the application will automatically download them (~2GB) with a progress bar. This may take a few minutes depending on your internet connection.

Project Structure

moondream-streamlit/
├── app/
│   ├── components/
│   │   ├── chat_interface.py    # Chat UI component
│   │   └── image_uploader.py    # Image upload component
│   ├── utils/
│   │   ├── logger.py           # Logging configuration
│   │   └── moondream_integration.py  # Moondream model wrapper
│   └── main.py                 # Main Streamlit application
├── logs/                       # Application logs
├── moondream-latest-int8.bin   # Model weights (downloaded on first run, after extraction)
├── moondream-latest-mtb.tar    # Converted model archive (created automatically)
├── requirements.txt            # Project dependencies
└── README.md                   # This file

Usage

Launch the application using the steps above
Upload an image using the file uploader
Wait for the image description to generate
Ask questions about the image using the chat interface
View the chat history below the input field

Troubleshooting CUDA Issues

Ensure you have the correct NVIDIA drivers installed
Verify CUDA installation with torch.cuda.is_available()
Check GPU compatibility with CUDA 12.1

ONNX DLL Error

Install Microsoft Visual C++ Redistributable 2019
Restart your computer after installation
Check Windows Event Viewer for detailed error messages

Logging The application logs are stored in the logs/ directory with timestamps. Each session creates a new log file with the format: app_YYYYMMDD_HHMMSS.log

Contributing Feel free to open issues or submit pull requests with improvements.

License MIT License - feel free to use and modify as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
.gitignore		.gitignore
download_model.py		download_model.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Moondream Streamlit Interface

Features

Prerequisites

Installation

Important Note About Model Format

Running the Application

Project Structure

About

Releases

Packages

Languages

parsakhaz/moondream-pip

Folders and files

Latest commit

History

Repository files navigation

Moondream Streamlit Interface

Features

Prerequisites

Installation

Important Note About Model Format

Running the Application

Project Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages