Amazon bestsellers scraper for Books

To scrape details such as book name, price, author, rating, number of reviews and rank.

This gets the top 100 bestsellers books from the main page and the top 100 from each genre page.

Make a request to each URL using selenium webdriver and grab the source code.
Parse the HTML through the LXML parser in BeautifulSoup.
Then identify the data points using CSS selectors and save the data as a dataframe.

Google Chrome driver - download from the site according to browser settings. (download here)

The python packages:

bs4 : BeautifulSoup package for converting HTML code into a soup object to gather data.
pandas : To store and manipulate data using dataframes.
time : To introduce wait functionality in the code while the page is loading.
re : python regex package to check for patterns.
selenium - To use chrome driver to make a request to the URL and capture the HTML source code.

Install package requirements via command line: pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Amazon_Bestseller_books.csv		Amazon_Bestseller_books.csv
README.md		README.md
amz_books_bestseller_scraper.py		amz_books_bestseller_scraper.py
requirements.txt		requirements.txt

Provide feedback