Open-source tool for generating descriptive image captions and alt-text using AI. Built with Hugging Face transformers and Gradio.
- 🖼️ Image Caption Generation: Generate descriptive captions for images using AI
- 🚀 Gradio Interface: Fast, modern AI-Powered apps interface with the API ready feature
- 🤗 Hugging Face Integration: Uses state-of-the-art BLIP model for image captioning
- 📦 Batch Processing: Support for processing multiple images at once
- 🔥 GPU Support: Automatic GPU acceleration when available
- Clone the repository:
git clone https://github.com/alinrajpoot/captnr.git
cd captnr
- Install dependencies:
pip install -r requirements.txt
- Run the server:
python main.py
The API will be available at http://localhost:9000
Once the server is running, visit http://localhost:9000 in browser.
- Python 3.8+
- Gradio
- Hugging Face Transformers
- PyTorch
- Pillow (PIL)
This tool uses the Salesforce/blip-image-captioning-base
model from Hugging Face, which provides high-quality image captions suitable for accessibility and content description purposes.
Open source - feel free to contribute and improve!