A Look into COVID-19 Vaccination Debate on Twitter
Authors/Creators
- 1. Universidade Federal de Minas Gerais
- 2. IBM Research
Description
Our data collection was driven by the goal of gathering a corpus of English-language tweets that would be informative of the online debate on COVID-19 vaccines worldwide. To that end, we used the Twitter API Search to collect tweets based on specific keywords related to COVID-19 vaccination. We built a list of such keywords that include terms related to both pro and anti-vaccine discourse as well as words related to the most well known COVID-19 vaccines available so far. Specifically, we consider the following list of keywords: vaccine, vaccination, anti-vaccination, antivax, anti-vaccine, anti-vax, anti-vaxxers, NoForcedVaccination, getvaccinated, pfizer, moderna, astrazeneca, covaxin, biontech, novavax, coronavac, sputnikv, bnt162b2. In total, we gathered over $12$ million tweets, covering 9 weeks, from December 1st, 2020 to January 31st, 2021. This is an important period that includes the launch of the first worldwide COVID-19 vaccination campaign (launched on December 8th in the United Kingdom), as well as several other important real-world events that influenced and dictated people's discussions.
This dataset is aggregated by weeks and keywords. Only the tweets IDs are available following Twitter's Privacy Policy.
Please cite as: MALAGOLI, L. G. ; STANCIOLI, J. ; VASCONCELOS, M. ; FERREIRA, C. H. ; SILVA, A. P. C. ; ALMEIDA, Jussara. A Look into COVID-19 Vaccination Debate on Twitter. In: 13th ACM Conference on Web Science, 2021.
Files
tweet_ids_dataset.zip
Files
(89.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:8e31c08e0f7e510b95c7344b7b0d163e
|
89.5 MB | Preview Download |