RAG Application with Optimizations on HNSW Index, Quantization, Hybrid Search and Semantic Caching 🗽

This repository contains an application application using Qdrant as Vector Database, Hybrid Seach with SPLADE embeddings, semantic caching and some optimization insights related to the configuration of the HNSW Index and quantization

For detailed project descriptions, refer to these Medium blogs:

Tech Stack

Feel free to ⭐ and clone this repo 😉

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
.gitignore		.gitignore
README.md		README.md
batch_insert_data_collection_sparse.py		batch_insert_data_collection_sparse.py
create_collection_sparse.py		create_collection_sparse.py
embeddings.py		embeddings.py
extract_ingredients.py		extract_ingredients.py
extract_words.py		extract_words.py
generate_response.py		generate_response.py
requirements.txt		requirements.txt
sample_splade_embedding.py		sample_splade_embedding.py
semantic_search_caching.py		semantic_search_caching.py
update_collections.py		update_collections.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Application with Optimizations on HNSW Index, Quantization, Hybrid Search and Semantic Caching 🗽

Tech Stack

About

Languages

benitomartin/semantic-caching-qdrant-splade

Folders and files

Latest commit

History

Repository files navigation

RAG Application with Optimizations on HNSW Index, Quantization, Hybrid Search and Semantic Caching 🗽

Tech Stack

About

Topics

Resources

Stars

Watchers

Forks

Languages