Published January 13, 2023 | Version V2 (3 FEB 2023)
Dataset Open

VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter

  • 1. The University of Sheffield

Description

We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or irrelevant.

***

Please use the V2 version.

***

We split our dataset into two separate files:

(1) VaccineHesitancy_train_v2.csv (Single + Double annotated)

(2) VaccineHesitancy_test.csv (Double annotated)

We present the details of this dataset here:

VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter (ICWSM 2023)

Our Pre-trained model (GateNLP/covid-vaccine-twitter-bert) : https://huggingface.co/GateNLP/covid-vaccine-twitter-bert

Paper: https://ojs.aaai.org/index.php/ICWSM/article/view/22213/21992

 

@inproceedings{mu2023vaxxhesitancy,
  title={VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter},
  author={Mu, Yida and Jin, Mali and Grimshaw, Charlie and Scarton, Carolina and Bontcheva, Kalina and Song, Xingyi},
  booktitle={Proceedings of the International AAAI Conference on Web and Social Media},
  volume={17},
  pages={1052--1062},
  year={2023}
}

 

 

 

Files

VaccineHesitancy_test.csv

Files (210.1 kB)

Name Size Download all
md5:2cb4c9e102370e6521020a3b859131c8
33.5 kB Preview Download
md5:490799e3247fc571f8d588fd68909c4e
176.5 kB Preview Download

Additional details

Funding

XAIvsDisinfo: eXplainable AI Methods for Categorisation and Analysis of COVID-19 Vaccine Disinformation and Online Debates EP/W011212/1
UK Research and Innovation
SoBigData-PlusPlus – SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics 871042
European Commission