Published June 13, 2024 | Version v1
Dataset Open

Labeled data and models for COVID-19 vaccine related tweets with stance, location, and topics

  • 1. Northern Illinois University
  • 2. ROR icon University of South Carolina

Description

The dataset contains Tweet IDs along with the location and tweet timestamp. The tweets are labeled based on motivating/demotivating status, stance towards the COVID-19 vaccine, and topic in the tweet text. To comply with Twitter guidelines, we removed the tweet texts and author information. You can use Hydrator API to hydrate the tweets.

The repository also contains the machine-learning models for topic modeling, de/motivation classifier, and stance detection from the tweets.

Files

anonymized_tweets_with_labels.csv

Files (6.6 GB)

Name Size Download all
md5:5d795c9be77408d69cc0bdf916624c82
48.7 MB Preview Download
md5:bed44686a54db0d5867e8dd35e0755ff
662.7 MB Download
md5:eb87f58fd38ac7d987f722cd6d6ee495
1.2 GB Download
md5:5e54038c1ee75a81a0a11ec636971b4c
3.7 GB Download
md5:ea5c32d80ef296f81c89b15c5862d042
981.5 MB Download

Additional details

Related works

Is derived from
Journal article: 10.1016/j.nlp.2024.100085 (DOI)

Software

Programming language
Python