National University of Singapore (NUS) CORS Scrapper

This project scrapes through the CORS website for module information, including lecture and tutorial timings and then stores it into a MySQL database.

The implementation was initially based on another project by shadowsun7 (https://github.com/shadowsun7/cors-api), however only the scrapper was used. Hence it is only fitting that this is listed as a separate project.

Dependencies

You will need:

Python interpreter
Scrapy
MySQLdb-python

and optionally, but recommended: VirtualEnv, which will create an isolated environment to install your dependencies.

Configuration

You will need to give cors/settings.py your MySQL database details so that scrapy will know where to dump them.

How to scrape

Simply change to the cors-scraper directory and run:

scrapy crawl cors

The CORS spider will start to crawl through all the modules in the CORS website and dump the information into the database.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
cors		cors
.gitignore		.gitignore
README.markdown		README.markdown
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

National University of Singapore (NUS) CORS Scrapper

Dependencies

Configuration

How to scrape

About

Releases

Packages

Languages

crainiarc/cors-scraper

Folders and files

Latest commit

History

Repository files navigation

National University of Singapore (NUS) CORS Scrapper

Dependencies

Configuration

How to scrape

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages