Web Scraper

This project aims to highlight web scraping from different kind of webpages:

Static pages with tables
Pages with pagination
Pages with AJAX/JS Pagination
All robots.txt (if any) files of the sites have been obeyed..
Reasonable delays have been implemented as to not overload the websites with requests.

Features

The application is capable of scraping data from the following site

books.toscrape.com - A scraping sandbox that resembles an e-commerce website
scrapethissite.com - A scraping sandbox that contains a page with AJAX
worldometers.info- A statistics site that stores info about population and other things such as the COVID 19 pandemic

The scraped data can be saved either as a csv or as a xlsx file.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Jupyter notebooks		Jupyter notebooks
LICENSE		LICENSE
README.md		README.md
ajax_requests.py		ajax_requests.py
ajax_selenium.py		ajax_selenium.py
ecommerce.py		ecommerce.py
gui.py		gui.py
population.py		population.py
scraping_utils.py		scraping_utils.py