Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
data		data
models		models
modules		modules
preprocess		preprocess
static		static
LICENSE		LICENSE
README.md		README.md
data.csv		data.csv
mynotebook.ipynb		mynotebook.ipynb
requirements.txt		requirements.txt
scrape.py		scrape.py
scrape_cli.py		scrape_cli.py

Repository files navigation

Resume_scrapper

A CLI tool specifically designed to scrape this webpage: https://www.jobspider.com/job/

SETUP ENVIRONMENT

Make sure you have virtualenv package installed on your local machine and the cloned repository is your current directory. If not:

pip install virtualenv

Create environment with environment name(here venv):

virtualenv venv

Activate the environment with:

source venv/bin/activate

To install the dependencies for this project, use the requirements.txt file:

pip install -r requirements.txt

Using the command

Again, make sure you are inside the repository. To use the command and get help, type:

$ python3 scrape_cli.py -h

There are two command options: --category <number> and --domain <keyword>:
1. For scrapping the Resumes by category, go to this url: https://www.jobspider.com/job/browse-resumes.asp and select the category:
  
  Check the category number in the url-
  
  Go back to your command line and enter the category number argument (here in the image, 16)
```
$ python3 scrape_cli.py -c 16
```
2. For scrapping by search keyword, go to this url: https://www.jobspider.com/job/resume-search.asp
  
  Enter the same keyword used in the search box in the command line argument like this
```
$ python3 scrape_cli.py -d developer
```

If done correctly you'll get a final web scrapped summary of resumes!

Feel free to contribute to this tool for more advanced searches or reducing the search time

About

No description, website, or topics provided.

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages