This scraper is part of our IE 684 Web Mining Project: "Mining Node.js Module Dependencies – Analysis of a Dependency Graph & Development of a Dependency Recommender System"
Usage:
-
$ npm install
-
Set
YOUR_PART
variable insrc/all_npm.js
to the assigned part or changeNUM_PARTS
to1
to scrape all the modules in one sessions (not recommended). -
Set
YOUR_API_KEY
variable insrc/github/octokit.js
to your Github API Key -
$ npm start
: fetches all the modules within your assigned part defined inpackagesList.json
.packagesList.json
is a list of all NPM module names currently available on NPM and was created via npms CouchDB -
$ npm merge
: merges all fetched parts