Skip to content

timkmecl/studentska-prehrana-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Analysis of Changes in Student Food Prices 2022-23

Welcome to this repository, which provides a script to scrape data from Študentska prehrana, a website that lists restaurants that offer discounted meals to students in Slovenia. The script retrieves data from the Internet Archive to compare the prices of meals in June 2022 with the current prices listed on the website. The data is cleaned and merged into a single dataframe for further analysis.

In addition to the script, this repository also includes a Jupyter notebook overview.ipynb that calculates various statistics on the data, and histograms.ipynb that plots histograms of the price changes (see them inside folder plots).

I use the data as a basis for Boni 23 website (repo here)

Data

The repository contains the following data files:

Usage

You can import the functions from the scraper module and use them in your own scripts as follows:

from scraper import load_data, merge_data

df_new, df_old = load_data()
df = merge_data(df_new, df_old)

Alternatively, you can run the script directly:

python scraper.py

This will save the data to both a CSV and a JSON file in the /data directory.

Plots

Histogram of prices then and now

Histogram of price changes

Histogram of price increases

Histogram of price increases

About

Web Scraper and analysis of changes in Student Food Prices 2022-23

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published