Skip to content

Script for scraping the Internet Speculative Fiction Database and building large dataset of science fiction and fantasy book metadata.

License

Notifications You must be signed in to change notification settings

Capybasilisk/SFF-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SFF-Scraper

This code builds a large CSV dataset of science fiction and fantasy book metadata by scraping the Internet Speculative Fiction Database.

The metadata consists of book title, author, publication date, and type. Type can be novel, short story, anthology, omnibus, etc.

After the code has finished running, there'll be over 120,000 rows in the generated dataset.

Running time is several hours on a basic Linux server.

The completed dataset is available on Kaggle:

https://www.kaggle.com/capybasilisk/science-fiction-and-fantasy-book-metadata

About

Script for scraping the Internet Speculative Fiction Database and building large dataset of science fiction and fantasy book metadata.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages