GitHub - ainilaha/EpiMind: Large Language Model (LLM) for Epidemiology with Retrieval-Augmented Generation (RAG)

EpiMind - Generative AI for Epidemiology

Large Language Model(LLM) for Epidemiology with RAG (Retrieval-Augmented Generation)

Retrive data from the CDC website
- convert html doc to markdown doc (Done)
- extract the data from the markdown doc
- store the data in a vector database pgvector of Postgresql to perform vector searches in LlamaIndex
RAG model
- RAG model for epidemiology
- Named Entity Recognition(NER) for epidemiology
- Improve search and generation(KNN,ANN,PCA, LDA etc)
- knowledge graph for epidemiology
- Agentic-flow model for epidemiology

Meeting Agenda

Help RG to setup the environment
Save data into local or container during develop stage
How to split (chunk) the text? split based on title would always work (eg too long or too short)
Embedding models for the text

Setup the Environment

docker \
    run \
        --name epimind \
        -d \
        -v <YOUR LOCAL CODE PATH>:/root/ \
        -p 5432:5432 \
        -p 8888:8888 \
        -e POSTGRES_PASSWORD=password \
        ainilaha/epimind:v0.0.3

Try to access: http://localhost:8888

You can interact with docker container with:

docker exec -it epimind /bin/bash

Also, you can build with the image locally:

You can build the image locally with:

docker build -t epimind:lastest . the local building would much smaller.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
Docker		Docker
crawler		crawler
data		data
pgvector		pgvector
rag		rag
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EpiMind - Generative AI for Epidemiology

Large Language Model(LLM) for Epidemiology with RAG (Retrieval-Augmented Generation)

Meeting Agenda

Setup the Environment

About

Releases

Packages

Contributors 2

Languages

ainilaha/EpiMind

Folders and files

Latest commit

History

Repository files navigation

EpiMind - Generative AI for Epidemiology

Large Language Model(LLM) for Epidemiology with RAG (Retrieval-Augmented Generation)

Meeting Agenda

Setup the Environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages