⚕️ Multi-Agent-Medical-Assistant :
AI-powered multi-agentic system for medical diagnosis and assistance

📚 Table of Contents

Overview
Demo
Technical Flow Chart
Key Features
Tech Stack
Installation and Setup
- Using Docker
- Manual Installation
Usage
Contributions
License
Citing
Contact

📌 Overview

The Multi-Agent Medical Assistant is an AI-powered chatbot designed to assist with medical diagnosis, research, and patient interactions.

🚀 Powered by Multi-Agent Intelligence, this system integrates:

🤖 Large Language Models (LLMs)
🖼️ Computer Vision Models for medical imaging analysis
📚 Retrieval-Augmented Generation (RAG) leveraging vector databases
🌐 Real-time Web Search for up-to-date medical insights
👨‍⚕️ Human-in-the-Loop Validation to verify AI-based medical image diagnoses

What You’ll Learn from This Project 📖

🔹 👨‍💻 Multi-Agent Orchestration with structured graph workflows
🔹 🔍 Advanced RAG Techniques – hybrid retrieval, semantic chunking, and vector search
🔹 ⚡ Confidence-Based Routing & Agent-to-Agent Handoff
🔹 🔒 Scalable, Production-Ready AI with Modularized Code & Robust Exception Handling

📂 For learners: Check out agents/README.md for a detailed breakdown of the agentic workflow! 🎯

💫 Demo

Multi-Agent-Medical-Assistant-v1-with-voiceover-compressed.mp4

If you like what you see and would want to support the project's developer, you can ! :)

📂 For an even more detailed demo video: Check out Multi-Agent-Medical-Assistant-v1.9. 📽️

🛡️ Technical Flow Chart

✨ Key Features

🤖 Multi-Agent Architecture : Specialized agents working in harmony to handle diagnosis, information retrieval, reasoning, and more
🔍 Advanced RAG Retrieval System :
- Unstructured.io parsing to extract and embed text along with tables from PDFs.
- Semantic chunking with structural boundary awareness.
- Qdrant hybrid search combining BM25 sparse keyword search along with dense embedding vector search.
- Query expansion with related terms to enhance search results.
- Metadata enrichment to add context and improve seach accuracy.
- Input-output guardrails to ensure safe and relevant responses.
- Confidence-based agent-to-agent handoff between RAG and Web Search to prevent hallucinations.
- Supported file types for RAG ingestion and retrieval: .txt, .csv, .json, .pdf.
🏥 Medical Imaging Analysis
- Brain Tumor Detection (TBD)
- Chest X-ray Disease Classification
- Skin Lesion Segmentation
🌐 Real-time Research Integration : Web search agent that retrieves the latest medical research papers and findings
📊 Confidence-Based Verification : Log probability analysis ensures high accuracy in medical recommendations
🎙️ Voice Interaction Capabilities : Seamless speech-to-text and text-to-speech powered by Eleven Labs API
👩‍⚕️ Expert Oversight System : Human-in-the-loop verification by medical professionals before finalizing outputs
⚔️ Input & Output Guardrails : Ensures safe, unbiased, and reliable medical responses while filtering out harmful or misleading content
💻 Intuitive User Interface : Designed for healthcare professionals with minimal technical expertise

Note

Upcoming features:

Brain Tumor Medical Computer Vision model integration.
Streaming of LLM responses to UI.

🛠️ Technology Stack

Component	Technologies
🔹 Backend Framework	FastAPI, Flask
🔹 Agent Orchestration	LangGraph
🔹 Knowledge Storage	Qdrant Vector Database
🔹 Medical Imaging	Computer Vision Models
	• Brain Tumor: Object Detection (PyTorch)
	• Chest X-ray: Image Classification (PyTorch)
	• Skin Lesion: Semantic Segmentation (PyTorch)
🔹 Guardrails	LangChain
🔹 Speech Processing	Eleven Labs API
🔹 Frontend	HTML, CSS, JavaScript
🔹 Deployment	Docker, CI/CD Pipeline

🚀 Installation & Setup

📌 Option 1: Using Docker

1️⃣ Clone the Repository

git clone https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant.git  
cd Multi-Agent-Medical-Assistant

2️⃣ Set Up API Keys

Create a .env file and add the following API keys:

Note

You may use any llm and embedding model of your choice...

If using Azure OpenAI, no modification required.
If using direct OpenAI, modify the llm and embedding model definitions in the 'config.py' and provide appropriate env variables.
If using local models, appropriate code changes might be required throughout the codebase especially in 'agents'.

Warning

If all necessary env variables are not provided, errors will be thrown in console.

# LLM Configuration (Azure Open AI - gpt-4o used in development)
# If using any other LLM API key or local LLM, appropriate code modification is required
deployment_name = 
model_name = gpt-4o
azure_endpoint = 
openai_api_key = 
openai_api_version = 

# Embedding Model Configuration (Azure Open AI - text-embedding-ada-002 used in development)
# If using any other embedding model, appropriate code modification is required
embedding_deployment_name =
embedding_model_name = text-embedding-ada-002
embedding_azure_endpoint = 
embedding_openai_api_key = 
embedding_openai_api_version = 

# Speech API Key (Free credits available with new Eleven Labs Account)
ELEVEN_LABS_API_KEY = 

# Web Search API Key (Free credits available with new Tavily Account)
TAVILY_API_KEY = 

# Hugging Face Token - using reranker model "ms-marco-TinyBERT-L-6"
HUGGINGFACE_TOKEN = 

# (OPTIONAL) If using Qdrant server version, local does not require API key
QDRANT_URL = 
QDRANT_API_KEY =

3️⃣ Run with Docker Compose

docker-compose up -d

This will start two services:

fastapi-backend: Runs the FastAPI backend on port 8000
main-app: Runs the main application (app.py)

4️⃣ Ingest data into the Vector DB

# To ingest a single file
docker-compose run --rm fastapi-backend ingest --file ./data/raw/your_file.pdf

# To ingest all files in a directory
docker-compose run --rm fastapi-backend ingest --dir ./data/raw

5️⃣ Access the Application

The application will be available at: http://localhost:8000

6️⃣ Stopping the Application

docker-compose down

📌 Option 2: Manual Installation

1️⃣ Clone the Repository

git clone https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant.git  
cd Multi-Agent-Medical-Assistant

2️⃣ Create & Activate Virtual Environment

If using conda:

conda create --name <environment-name> python=3.11
conda activate <environment-name>

If using python venv:

python -m venv <environment-name>
source <environment-name>/bin/activate  # For Mac/Linux
<environment-name>\Scripts\activate     # For Windows

3️⃣ Install Dependencies

Important

ffmpeg is required for speech service to work.
Poppler and Tesseract OCR are essential for table extraction from PDFs using Unstructured.IO.

To install poppler and tesseract OCR for Ubuntu/Debian/macOS:

# if on Ubuntu/Debian
sudo apt-get update
sudo apt-get install -y poppler-utils tesseract-ocr

# if on macOS
brew install poppler tesseract

Install Poppler for Windows:

Download the latest poppler release for Windows from: https://github.com/oschwartz10612/poppler-windows/releases/
Extract the ZIP file to a location on your computer (e.g., 'C:\Program Files\poppler')
Add the bin directory to your PATH environment variable (e.g., 'C:\Program Files\poppler\bin')

Install Tesseract OCR for Windows:

Download the Tesseract installer from: https://github.com/UB-Mannheim/tesseract/wiki
Run the installer and complete the installation
By default, it installs to 'C:\Program Files\Tesseract-OCR'
Make sure to add it to your PATH during installation or add it manually afterward

Verify your installation:

Open a new command prompt (to ensure it has the updated PATH)
Run 'tesseract --version' to verify Tesseract is properly installed
Run 'pdfinfo -h' or 'pdftoppm -h' to verify Poppler is properly installed

If using conda:

conda install -c conda-forge ffmpeg

pip install -r requirements.txt

If using python venv:

winget install ffmpeg

pip install -r requirements.txt

Might be required:

pip install unstructured[pdf]

4️⃣ Set Up API Keys

Create a .env file and add the required API keys as shown in Option 1.

5️⃣ Run the Application

Run the following commands one after another in separate windows with same directorty and virtual environment. Keep both running simultanesouly.

uvicorn api.fastapi_backend:app --reload

python app.py

6️⃣ Ingest additional data into the Vector DB

Run any one of the following commands as required. First one to ingest one document at a time, second one to ingest multiple documents from a directory.

python ingest_rag_data.py --file ./data/raw/brain_tumors_ucni.pdf

python ingest_rag_data.py --dir ./data/raw

Docker Related Information:

Data Persistence

The vector database data is stored in Docker volumes:

vector-db-processed: Contains data from the data/processed directory
vector-db-qdrant: Contains data from the data/qdrantdb directory
upload-data: Contains uploaded files in the uploads directory

This ensures your data persists even if you remove the containers.

Troubleshooting Docker Setup

If the containers aren't starting properly, check logs:

docker-compose logs fastapi-backend
docker-compose logs main-app

Make sure all required environment variables are set in the .env file
To completely clean up and restart:

docker-compose down -v
docker-compose up -d --build

🧠 Usage

Note

The first run can be jittery and may get errors - be patient and check the console for ongoing downloads and installations.
On the first run, many models will be downloaded - yolo for tesseract ocr, computer vision agent models, cross-encoder reranker model, etc.
Once they are completed, retry. Everything should work seamlessly since all of it is thoroughly tested.

Upload medical images for AI-based diagnosis. Task specific Computer Vision model powered agents - upload images from 'sample_images' folder to try out.
Ask medical queries to leverage retrieval-augmented generation (RAG) if information in memory or web-search to retrieve latest information.
Use voice-based interaction (speech-to-text and text-to-speech).
Review AI-generated insights with human-in-the-loop verification.

🤝 Contributions

Contributions are welcome! Please check the issues tab for feature requests and improvements.

⚖️ License

This project is licensed under the Apache-2.0 License. See the LICENSE file for details.

📝 Citing

@misc{Souvik2025,
  Author = {Souvik Majumder},
  Title = {Multi Agent Medical Assistant},
  Year = {2025},
  Publisher = {GitHub},
  Journal = {GitHub repository},
  Howpublished = {\url{https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant}}
}

📬 Contact

For any questions or collaboration inquiries, reach out to Souvik Majumder on:

🔗 LinkedIn: https://www.linkedin.com/in/souvikmajumder26

🔗 GitHub: https://github.com/souvikmajumder26

🔝 Return

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚕️ Multi-Agent-Medical-Assistant :
AI-powered multi-agentic system for medical diagnosis and assistance

AI-powered multi-agentic system for medical diagnosis and assistance

📚 Table of Contents

📌 Overview

What You’ll Learn from This Project 📖

💫 Demo

🛡️ Technical Flow Chart

✨ Key Features

🛠️ Technology Stack

🚀 Installation & Setup

📌 Option 1: Using Docker

1️⃣ Clone the Repository

2️⃣ Set Up API Keys

3️⃣ Run with Docker Compose

4️⃣ Ingest data into the Vector DB

5️⃣ Access the Application

6️⃣ Stopping the Application

📌 Option 2: Manual Installation

1️⃣ Clone the Repository

2️⃣ Create & Activate Virtual Environment

3️⃣ Install Dependencies

4️⃣ Set Up API Keys

5️⃣ Run the Application

6️⃣ Ingest additional data into the Vector DB

Docker Related Information:

Data Persistence

Troubleshooting Docker Setup

🧠 Usage

🤝 Contributions

⚖️ License

📝 Citing

📬 Contact

About

Releases 9

Sponsor this project

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
.github		.github
agents		agents
api		api
assets		assets
data		data
sample_images		sample_images
templates		templates
tests		tests
uploads		uploads
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.py		config.py
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
ingest_rag_data.py		ingest_rag_data.py
requirements.txt		requirements.txt

License

souvikmajumder26/Multi-Agent-Medical-Assistant

Folders and files

Latest commit

History

Repository files navigation

⚕️ Multi-Agent-Medical-Assistant :AI-powered multi-agentic system for medical diagnosis and assistance

AI-powered multi-agentic system for medical diagnosis and assistance

📚 Table of Contents

📌 Overview

What You’ll Learn from This Project 📖

💫 Demo

🛡️ Technical Flow Chart

✨ Key Features

🛠️ Technology Stack

🚀 Installation & Setup

📌 Option 1: Using Docker

1️⃣ Clone the Repository

2️⃣ Set Up API Keys

3️⃣ Run with Docker Compose

4️⃣ Ingest data into the Vector DB

5️⃣ Access the Application

6️⃣ Stopping the Application

📌 Option 2: Manual Installation

1️⃣ Clone the Repository

2️⃣ Create & Activate Virtual Environment

3️⃣ Install Dependencies

4️⃣ Set Up API Keys

5️⃣ Run the Application

6️⃣ Ingest additional data into the Vector DB

Docker Related Information:

Data Persistence

Troubleshooting Docker Setup

🧠 Usage

🤝 Contributions

⚖️ License

📝 Citing

📬 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 9

Sponsor this project

Languages

⚕️ Multi-Agent-Medical-Assistant :
AI-powered multi-agentic system for medical diagnosis and assistance