Boosting the Performance of Large Language Models for Question Answering with Knowledge Graph Integration

Praktikum Information Service Engineering (Master) Task 2 members: Haoran Yang, Mingze Li, Zhaotai Liu.

Supervisors: Mirza Mohtashim Alam, Ebrahim Norouzi, Genet Asefa Gesese.

There are three files: RAG_GPT3_5, transfer_PDF_to_str, and main. The main file integrates questions, queries, and a string converted from a PDF by transfer_PDF_to_str, and feeds them into the RAG model based on GPT-3.5, which is implemented in RAG_GPT3_5.

RAG before interim report has uploaded: RAG_GPT3_5.py

This code is mainly used to extract relevant information from large amounts of text data to answer a series of questions. First, it uses regular expressions and sorting logic to split the raw data into smaller chunks of text. Then, a SentenceTransformer is utilized to generate embedding representations of these text blocks, and the faiss library is used to create an index for fast similarity searches. Next, the program retrieves the text blocks most relevant to the query given the query and the created knowledge base. Finally, these relevant chunks of text are used as context for the OpenAI GPT-3.5 model to generate answers to a series of questions. The code also includes the necessary library imports and configurations, as well as the definition of some functions for processing text blocks, creating knowledge bases, retrieving related text blocks, and generating answers using the GPT-3.5 model.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.gitattributes		.gitattributes
0entities_list_update.xlsx		0entities_list_update.xlsx
0relationships_list_update.xlsx		0relationships_list_update.xlsx
10X10Named_Entity_Recognition_and_Relationship_Identification_.ipynb		10X10Named_Entity_Recognition_and_Relationship_Identification_.ipynb
1QustionEntityandRelationship.xlsx		1QustionEntityandRelationship.xlsx
2SimilarEntities10.xlsx		2SimilarEntities10.xlsx
2SimilarEntities10_up.xlsx		2SimilarEntities10_up.xlsx
2SimilarEntities5.xlsx		2SimilarEntities5.xlsx
2SimilarEntities5_up.xlsx		2SimilarEntities5_up.xlsx
3SimilarRelationships5X10.xlsx		3SimilarRelationships5X10.xlsx
4.beforeSparql.xlsx		4.beforeSparql.xlsx
4beforeSparq10X10.xlsx		4beforeSparq10X10.xlsx
4beforeSparq5X10.xlsx		4beforeSparq5X10.xlsx
5X10Named_Entity_Recognition_and_Relationship_Identification_.ipynb		5X10Named_Entity_Recognition_and_Relationship_Identification_.ipynb
CHANGELOG		CHANGELOG
CSV_Reader.ipynb		CSV_Reader.ipynb
Named_Entity_Recognition_and_Relationship_Identification_.ipynb		Named_Entity_Recognition_and_Relationship_Identification_.ipynb
RAG and OpenAI’s Function-Calling for Question-Answering with Langchain.py		RAG and OpenAI’s Function-Calling for Question-Answering with Langchain.py
RAG.ipynb		RAG.ipynb
RAG_GPT3_5.py		RAG_GPT3_5.py
README.md		README.md
entities.csv		entities.csv
extract_all_entities.py		extract_all_entities.py
extract_ttl_data.py		extract_ttl_data.py
main.py		main.py
output.ttl		output.ttl
relations.csv		relations.csv
transfer_PDF_to_str.py		transfer_PDF_to_str.py
triples.csv		triples.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Boosting the Performance of Large Language Models for Question Answering with Knowledge Graph Integration

About

Releases

Packages

Contributors 3

Languages

Mingze101/Boosting-the-Performance-of-LLM-for-QA-with-KGI

Folders and files

Latest commit

History

Repository files navigation

Boosting the Performance of Large Language Models for Question Answering with Knowledge Graph Integration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages