Skip to content

saither0204/simple_information_retrival_system

Repository files navigation

Project related to Course Subject: Information Retrieval.

Making this Simple Infomation Retrival System for aesopa10.txt file.

Simple Information Retrieval System

Project Description

This project is a simple information retrieval system that allows users to search for documents based on keywords. The system uses a vector space model to represent documents as vectors and calculate the cosine similarity between the query vector and document vectors. The system also uses a simple inverted index to speed up the search process and simple linear search model as control.

Table of Contents

Installation

This project does not demand any additional libraries in python. The only thing you need to do is to clone the repository and run the simple_information_retrival.py file as the usage specifies the command line arguments.

Usage

The system can be run from the command line using the following command to list a few:

- python simple_information_retrival.py --extract-collection aesopa10.txt
- python simple_information_retrival.py --query "somesearchterm" --model "vector/bool" --search-mode "inverted/linear" --documents "original/no_stopwords" --stemming
- python simple_information_retrival.py --query "somesearchterm" --model "vector/bool" --search-mode "inverted/linear" --documents "original/no_stopwords"
python simple_information_retrival.py --query "somesearchterm" --model "vector" --documents "original/no_stopwords"

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages