ScrapFly Scrapers 🕷️

This repository contains educational example scrapers for popular web scraping targets using the ScrapFly web scraping API and Python.
Most Scrapers use a simple web scraping stack:

Python version 3.10+
Scrapfly's Python SDK for sending HTTP request, bypass blocking and parsing the HTML using the built-in parsel selector.
asyncio for writing concurrent code using the async/await syntax.
JMESPath and nested-lookup for JSON parsing when needed.
loguru for logging.

To learn more about web scraping see our full tutorials on how to scrape these targets (and many others) see the scrapeguide directory.

List of Scrapers

Below is the list of available web scrapers for the supported domains along with their scrape guide, sample datasets, and status. 👇

Domain	Guide	Sample Datasets	Status
Aliexpress.com	How to Scrape Aliexpress.com (2024 Update)	Product pages Search pages Product reviews
Amazon.com	How to Scrape Amazon.com Product Data and Reviews	Product pages Search pages Product reviews
BestBuy.com	How to Scrape BestBuy Product, Offer and Review Data	Sitemap pages Product pages Review pages Search pages
Bing.com	How to Scrape Bing Search with Python	SERP data Keyword data Rich snippet data
Booking.com	How to Scrape Booking.com (2023 Update)	Hotel pages Search pages
Crunchbase.com	How to Scrape Crunchbase in 2024	Company pages Investor pages
Domain.com.au	How to Scrape Domain.com.au Real Estate Property Data	Property pages Search pages
Ebay.com	How to Scrape Ebay Using Python (2024 Update)	Product pages Product pages with variant Search pages
Etsy.com	How to Scrape Etsy.com Product, Shop and Search Data	Product pages Shop pages Search pages
Fashionphile.com	How to Scrape Fashionphile for Second Hand Fashion Data	Product pages Search pages
Glassdoor.com	How to Scrape Glassdoor (2024 update)	Job pages Review pages Salary pages
Goat.com	How to Scrape Goat.com for Fashion Apparel Data in Python	Product pages Search pages
Google.com	How to Scrape Google Search Results - How to Scrape Google Maps	SERP data Keyword data Google Maps place URLs Google Maps place data
Homegate.ch	How to Scrape Homegate.ch Real Estate Property Data	Property pages Search pages
Idealista.com	How to Scrape Idealista.com in Python - Real Estate Property Data	Property pages Search pages Provinces pages
Immobilienscout24.de	How to Scrape Immobilienscout24.de Real Estate Data	Property pages Search pages
Immoscout24.ch	How to Scrape Immoscout24.ch Real Estate Property Data	Property pages Search pages
Immowelt.de	How to Scrape Immowelt.de Real Estate Data	Property pages Search pages
Indeed.com	How to Scrape Indeed.com (2024 Update)	Job pages Search pages
Instagram.com	How to Scrape Instagram in 2024	User data All user posts Multi image post Video Post
Leboncoin.fr	How to Web Scrape Leboncoin.fr using Python	Ad pages Search pages
Nordstorm.com	How to Scrape Nordstrom Fashion Product Data	Product pages Search pages
Realestate.com.au	How to Scrape Realestate.com.au Property Listing Data	Property pages Search pages
Realtor.com	How to Scrape Realtor.com - Real Estate Property Data	Property pages Search pages Feed pages
Reddit.com	How to Scrape Reddit Posts, Subreddits and Profiles	Post pages Subreddit pages User comment pages User post pages
Redfin.com	How to Scrape Redfin Real Estate Property Data in Python	Property pages for sale Property pages for rent Search pages
Rightmove.com	How to Scrape RightMove Real Estate Property Data with Python	Property pages Search pages
Seloger.com	How to Scrape Seloger.com - Real Estate Listing Data	Property pages Search pages
Similarweb.com	How to Scrape SimilarWeb Website Traffic Analytics	Website pages Website compare pages Trend pages Sitemaps
Stockx.com	How to Scrape StockX e-commerce Data with Python	Product pages Search pages
Threads.net	How to scrape Threads by Meta using Python (2024 Update)	Profile pages Thread pages
TikTok.com	How To Scrape TikTok in 2024	Comment data Post data Profile data Channel data Search data
Tripadvisor.com	How to Scrape TripAdvisor.com (2024 Updated)	Hotel pages Search pages Location pages
Trustpilot.com	How to Scrape Trustpilot.com Reviews and Company Data	Company pages Reviews pages Search pages
Twitter(X).com	How to Scrape X.com (Twitter) using Python (2024 Update)	Profile pages Tweet pages
VestiaireCollective.com	How to Scrape Vestiaire Collective for Fashion Product Data	Product pages Search pages
G2.com	How to Scrape G2 Company Data and Reviews	Review pages Search pages Alternatives pages
Walmart.com	How to Scrape Walmart.com Product Data (2024 Update)	Product pages Search pages
Wellfound.com	How to Scrape Wellfound Company Data and Job Listings	Company pages Search pages
Linkedin.com	How to Scrape LinkedIn in 2024	Profile pages Company pages Job search pages Job pages
Yellowpages.com	How to Scrape YellowPages.com Business Data and Reviews (2024 Update)	Business pages Search pages
Yelp.com	How to Web Scrape Yelp.com (2024 update)	Business pages Review pages Search pages
YouTube.com	Channel videos Channel metadata Channel videos Video metadata Video comments Shorts' metadata
Zillow.com	How to Scrape Zillow Real Estate Property Data in Python	Property pages Search pages
Zoominfo.com	How to Scrape Zoominfo Company Data (2024 Update)	Company pages Directory pages FAQs data
Zoopla.co.uk	How to Scrape Zoopla Real Estate Property Data in Python	Property pages Search pages

Fair Use and Legal Disclaimer

This repository contains educational reference material to illustrate how accessible web scraping can be and the provided programs are not intented to be used in web scraping production. That being said, Scrapfly team is constantly updating and improving all of this code for optimal experience.

Scrapfly does not offer legal advice and as always, consult a lawyer when creating programs that interact with other people's websites though here's a good general intro of what NOT to do:

Do not store PII (personally identifiable information) of EU citizens who are protected by GDPR.
Do not scrape and repurpose entire public datasets which can be protected by database protection laws in some countries.
Do not scrape at rates that could damage the website and scrape only publicly available data.

Name		Name	Last commit message	Last commit date
Latest commit History 448 Commits
.github		.github
aliexpress-scraper		aliexpress-scraper
amazon-scraper		amazon-scraper
bestbuy-scraper		bestbuy-scraper
bing-scraper		bing-scraper
bookingcom-scraper		bookingcom-scraper
crunchbase-scraper		crunchbase-scraper
domaincom-scraper		domaincom-scraper
ebay-scraper		ebay-scraper
etsy-scraper		etsy-scraper
fashionphile-scraper		fashionphile-scraper
g2-scraper		g2-scraper
glassdoor-scraper		glassdoor-scraper
goat-scraper		goat-scraper
google-scraper		google-scraper
homegate-scraper		homegate-scraper
idealista-scraper		idealista-scraper
immobilienscout24-scraper		immobilienscout24-scraper
immoscout24-scraper		immoscout24-scraper
immowelt-scraper		immowelt-scraper
indeed-scraper		indeed-scraper
instagram-scraper		instagram-scraper
leboncoin-scraper		leboncoin-scraper
linkedin-scraper		linkedin-scraper
nordstorm-scraper		nordstorm-scraper
realestatecom-scraper		realestatecom-scraper
realtorcom-scraper		realtorcom-scraper
reddit-scraper		reddit-scraper
redfin-scraper		redfin-scraper
rightmove-scraper		rightmove-scraper
seloger-scraper		seloger-scraper
similarweb-scraper		similarweb-scraper
stockx-scraper		stockx-scraper
threads-scraper		threads-scraper
tiktok-scraper		tiktok-scraper
tripadvisor-scraper		tripadvisor-scraper
trustpilot-scraper		trustpilot-scraper
twitter-scraper		twitter-scraper
vestiairecollective-scraper		vestiairecollective-scraper
walmart-scraper		walmart-scraper
wellfound-scraper		wellfound-scraper
yellowpages-scraper		yellowpages-scraper
yelp-scraper		yelp-scraper
youtube-scraper		youtube-scraper
zillow-scraper		zillow-scraper
zoominfo-scraper		zoominfo-scraper
zoopla-scraper		zoopla-scraper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScrapFly Scrapers 🕷️

List of Scrapers

Fair Use and Legal Disclaimer

About

Releases

Packages

Contributors 5

Languages

License

scrapfly/scrapfly-scrapers

Folders and files

Latest commit

History

Repository files navigation

ScrapFly Scrapers 🕷️

List of Scrapers

Fair Use and Legal Disclaimer

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages