AWS AI RAG

Using RAG (Retrieval-augmented generation) to provide an LLM with up-to-date news.

Try the working demonstration.

Architecture summary

Main technologies/products used:

Amazon OpenSearch Service (free tier on t2.small.search with 10GB EBS storage)
AWS Bedrock generative AI (Anthropic Claude 3 Sonnet) and vector embedding (Titan)
AWS Lambda, SQS, API Gateway, S3, EventBridge, CloudFormation, DynamoDB
FastAPI, Mangum

Data ingestion pipeline

A periodically triggered lambda function obtains the URLs of the ten "most read" news articles from the main BBC news website https://www.bbc.co.uk/news and places each URL as a message in a SQS queue.
A second lambda removes URLs from the queue and does the following steps:
- Check the OpenSearch vector database and discard the URL if it has already been processed
- Extract the news article's full text and other information such as publication date, keywords, etc.
- Form a short text chunk comprising the article title and first 3 paragraphs
- Produce a vector embedding of the text chunk using the AWS Bedrock Titan model
- Store the article data as a document in the OpenSearch database, indexed by the embedding vector
A third lambda periodically deletes old documents from the OpenSearch database.

Query process with RAG

Serve a simple static website to the user from S3
The user sends a question in a REST POST request to the API Gateway, which is routed to a lambda
Produce a vector embedding of the question using AWS Bedrock
Do a semantic similarity search on the OpenSearch vector database using the vector embedding
Combine the full text of the most relevant search results with the question to produce a RAG query
Pass the RAG query to the AWS Bedrock Titan LLM and obtain a response
Present the response to the user, together with the URLs of the source news articles selected in step 4 as citations/further reading.

Front end

github.com/e-mit/aws-api-website is used to create an API Gateway which serves the files in /static/ and proxies the API.

This also configures a custom domain name for the gateway URL and provides CAPTCHA support.

Setup notes

AWS Bedrock is available in only a subset of AWS regions (e.g. Paris, not London)
Must enable account access to the LLMs ("Foundation Models") with Bedrock before use

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.github		.github
deletion_lambda		deletion_lambda
fastapi_lambda		fastapi_lambda
main_scrape_lambda		main_scrape_lambda
news_scrape_lambda		news_scrape_lambda
old_examples		old_examples
query_lambda		query_lambda
static		static
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
auth_dev.sh		auth_dev.sh
create_oss_index.py		create_oss_index.py
create_test_db_table.sh		create_test_db_table.sh
fastapi_dev_test.py		fastapi_dev_test.py
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run_fastapi_dev_test.sh		run_fastapi_dev_test.sh
run_tests.sh		run_tests.sh
setup.sh		setup.sh
stack.sh		stack.sh
template.yml		template.yml
upload_static.sh		upload_static.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS AI RAG

Architecture summary

Main technologies/products used:

Data ingestion pipeline

Query process with RAG

Front end

Setup notes

About

Releases 1

Packages

Languages

License

e-mit/aws-ai-rag

Folders and files

Latest commit

History

Repository files navigation

AWS AI RAG

Architecture summary

Main technologies/products used:

Data ingestion pipeline

Query process with RAG

Front end

Setup notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages