Facebook-Scraper

Author - Himanshu Maheshwari

What it does?

This script is used to scrape the text of posts on a public facebook page and comments(all of them and all of their replies) on those posts and save them in mongodb database. It also stores the link to the profile of the person who made the comment. It uses selenium. This script does not use facebook's graph api.

Requirments

python 3.x
Latest chrome wedriver, should be placed in the same folder as the script
mongodb and selenium to be installed

Note: This script run on linux machine, with slight modification it could be run on windows machine also.

How to use?

Open the facebook_scraper.py file in any text editor
1. Assign the name of the page to page_name variable (line 31).
2. Assign the URL to url variable (line 32).
3. Assign how many time do you want to scroll to total_scrolls variable (line 35). The value should be a non-negative integer. More the value more will be scroll.
4. Assign your facebook's email id to email variable (line 150).
5. Assign your password to password variable (line 151).
6. Now save the changes.
Next put the latest chrome webdriver in the same folder as the script. I am attaching a webdriver which was latest durint the writing of this script.
Open terminal and cd into the directory containing the script. Than run the script by writing python3 facebook_scraper.py and wait for the script to complete it's work.
suggestion - It is suggested to use VPN for scraping the data as facebook might block your ip if you scrawl the data. Though this script works fine even without VPN.

Output

The output of this script is all the text of posts on a public facebook page and comments(all of them and all of their replies) on those posts stored in facebook mongodb database. With slight changes you could store them in a text or csv file.

Warning

Scraping data from facebook without permission from facebook or without using it's graph api is illegal. Kindly refer to https://www.facebook.com/robots.txt for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
chromedriver		chromedriver
facebook_scraper.py		facebook_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Facebook-Scraper

Author - Himanshu Maheshwari

What it does?

Requirments

How to use?

Output

Warning

Cheers!!!

About

Releases

Packages

Languages

him-mah10/Facebook-Scraper

Folders and files

Latest commit

History

Repository files navigation

Facebook-Scraper

Author - Himanshu Maheshwari

What it does?

Requirments

How to use?

Output

Warning

Cheers!!!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages