Skip to content

Covid-19 awareness on twitter dataset and analysis code.

Notifications You must be signed in to change notification settings

mHassan11/Twitter-COVID19

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Characterizing COVID-19 Awareness on Twitter

Here is a link for dataset and the paper can be found here.

Dataset

The dataset is collected from Twitter using the scripts present in the "data_collection_scripts.zip" folder. You can read the detail about data collection pipeline in the section 2 of paper.

Country Data
USA Yes
Spain Yes
Italy Yes
France Yes
Germany Yes
UK Yes
Turkey Yes
Russia Yes
Brazil Yes
Belgium Yes
Canada Yes
Netherlands Yes
Switzerland Yes
Portugal Yes
India Yes
Peru Yes
Ireland Yes
Austria Yes
Sweden Yes
Israel Yes
Iran Unavailable
China No
  1. The data cleaning scripts are present with most of the countries' data in their respective folder
  2. The scripts are in basic Python and JavaScript, and the rudimentary knowledge of programming is sufficient to understand and modify them according to need.

Scripts

The scripts which are used for analysis, such as word-cloud, sentiment analysis, topic modelling(detail present in the section 3 of the paper) etc, can be found in code folder.

Note

  • The basic domain or programming knowledge is sufficient to understand the scripting for the project.
  • The scripts and data have been maintained by authors up until July 2020.
  • Academic contribution is expected and welcome. Feel free to reach out to authors in case of any query, concern and suggestion.

Authors

  1. Muhammad Saad at University of Central Florida (UCF) and can be approached with email.
  2. Muhammad Hassan at University of Illinois at Chicago (UIC) and can be approached at this email.

About

Covid-19 awareness on twitter dataset and analysis code.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published