Skip to content

Scrapes Spotify data from kworb.net. Uploads as parquet files to Azure Blob Storage

License

Notifications You must be signed in to change notification settings

DataGoblin/spotify-kworb-data-to-azure

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spotify Data to Azure Blob Storage

Scrapes Spotify data from kworb.net. Uploads as parquet files to Azure Blob Storage

Kworb presents all data on a static webpage which makes it incredibly easy to scrape and puts practically 0 stress on their servers!

Includes three functions that scrape specific pages.

Once the pages are scraped the dataframe is sent to an upload function. Which uploads the dataframe from memory into a parquet file that is uploaded to Azure Blob Storage

This repo is incredibly plug-n-play friendly. All you would need to do is fill the following variables in the upload function.

  • BLOB_STORE_CONN_STR = ''
  • CONTAINER_NAME = ''

Any further customisation such as file names can be achieved by editing the file_name variable in each function.

About

Scrapes Spotify data from kworb.net. Uploads as parquet files to Azure Blob Storage

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages