ScrapePlayStoreReviews

APIs to extract reviews from Google Play Store don't work beyond Page 111. This scraper is intended to scrape ALL reviews from an app. It works by automating the human action of scrolling play store reviews (via Selenium) and saving each review visible on the page.

A product of ❤ from 3B for the love of people and of course the internet and digital era

A ⭐ would make my day if you find the script useful

Installations Required (more to be added)

pip3 install selenium
pip3 install matplotlib
pip3 install requests-testadapter
pip3 install lxml

Prerequisites

Default to Python >3.5 and pip3
Install the chromedriver from http://chromedriver.chromium.org/ and in the scraper.py code point the variable chromedriver to the same

The json files with individual elements (only reviews, only dates) are saved in a directory called data_folder. If you want these files, create a directory in the base directory called 'data_folder' and run the code as is. If not, comment out all the lines where data is being saved to the 'data_folder' folder.

Running

If you want clutter then you can comment the line options.add_argument('headless') and you can now successfully clutter your desktop with the browser i.e., if you leave the code be the same then the browser won't be seen and also progress seen only in the terminal
scraper.py is the primary scraper. If you have a stable Internet connection with minimal interruptions, this would be the only code you would have to run.
Enter the APP_ID and appname in scraper.py and execute script. The url and session_id will be printed on the console. Copy these values in scraper_open_browser.py
The scraper should run and save data in the base folder.
(Yet to be implemented) In case the code is interrupted due to an error (usually because of connectivity loss), change the counter (k) in scraper_open_browser.py to the last review that was saved by the scraper. Ensure that the driver_id and url is that of the window created by scraper.py. Run scraper_open_browser.py
(Yet to be implemented) ~~Every time the code is interrupted because of an error, repeat previous step (change 'k' to the last review saved by scraper.py or scarper_open_browser.py)~~

TO-DO

scraper
add interrupt to clean stop post iteration
add restart from last stop
add multi threading if possible

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
scraper.py		scraper.py
scraper_open_browser.py		scraper_open_browser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScrapePlayStoreReviews

A product of ❤ from 3B for the love of people and of course the internet and digital era

A ⭐ would make my day if you find the script useful

Installations Required (more to be added)

Prerequisites

Running

TO-DO

About

Releases

Packages

Languages

tejavoo/GooglePlayReviewScraper

Folders and files

Latest commit

History

Repository files navigation

ScrapePlayStoreReviews

A product of ❤ from 3B for the love of people and of course the internet and digital era

A ⭐ would make my day if you find the script useful

Installations Required (more to be added)

Prerequisites

Running

TO-DO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages