Skip to content

Latest commit

 

History

History
107 lines (69 loc) · 2.54 KB

README.rst

File metadata and controls

107 lines (69 loc) · 2.54 KB

Book Scraper

Web scraping icon. Python programming language logo. Aiohttp icon.

Table of Contents

Introduction

Book Scraper is a python application that scrapes the book catalogue website, http://books.toscrape.com. It then allows you to look at the data related to books on the different pages of the website.

The purpose of this application was to become more familiar with web scraping.

Features

  • Scrapes the web site: http://books.toscrape.com
  • Asynchronous requests
  • Console menu with the options:
    • Look at highest rated books
    • Look at cheapest books
    • Get next available book in catalogue
    • Exit menu
  • Console menu also has a progress bar

Screenshots

Screenshot of Book Catalogue Website

Screenshot of Console Menu

Launch

To run, in the console enter:

python app.py

Technologies

Python Modules

  • BeautifulSoup4
    • Allows a web page to be scraped (parses its HTML or XML)
  • requests
    • Allows sending HTTP requests
  • logging
    • Allows for logging information.
  • asyncio
    • For using event loops.
  • aiohttp
    • For creating asynchronous HTTP Client/Server connections with asyncio.
  • async_timeout
    • Asyncio-compatible timeout context manager.
      • Timeouts requests that take too long.
  • time
    • For timing code execution time.