Scrapy

Scrapy to scrape/crawl website and get data to store and analyze

Command for run

Setup virtual environment with – pyhton3 -e venv nameofenv , a folder is created with name in root directory example (bot)

Activate virtual environment with – source /virtual_environment_folder/bin/activate

After activation, install all required packages using python install -r requirements.txt

And , now under subdirectory demo, run – scrapy crawl me

Running with Custom URLs:

After the project is set setup create .env file at root level.

Following variable are set now:

site - for website url

This .env file is served as environment variable to specify urls you want to crawl

Command to run:

Setup virtual environment with – pyhton3 -e venv nameofenv , a folder is created with name in root directory example (bot)
Activate virtual environment with – source /virtual_environment_folder/bin/activate
After activation, install all required packages using python install -r requirements.txt
And , now under subdirectory demo, run – scrapy crawl me

Running with Docker images:

Run docker build -t <image_name> .
docker run crawler:custom crawl me

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.vscode		.vscode
demo		demo
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy

Command for run

Running with Custom URLs:

Command to run:

Running with Docker images:

About

Releases

Packages

Contributors 2

Languages

License

sachin-s-joshi/Scrapy

Folders and files

Latest commit

History

Repository files navigation

Scrapy

Command for run

Running with Custom URLs:

Command to run:

Running with Docker images:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages