Dataset Restricted Access
Antigoni-Maria Founta; Constantinos Djouvas; Despoina Chatzakou; Ilias Leontiadis; Jeremy Blackburn; Gianluca Stringhini; Athena Vakali; Michael Sirivianos; Nicolas Kourtellis
Restricted Dataset for the "Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior" paper, published in ICWSM 2018. The full text of the paper can be found here. The Public version of the dataset can be found here
hatespeech_text_label_vote_RESTRICTED_100K.csv: contains ~100K raws with tweet text, the associated majority label, and the number of votes for the majority label. The tweets are shuffled so that there is no connection between tweet IDs and texts (in order to be in line with the T&C of Twitter).
Please cite the paper in any published work that uses any of these resources.
@inproceedings{founta2018large,
title={Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior},
author={Founta, Antigoni-Maria and Djouvas, Constantinos and Chatzakou, Despoina and Leontiadis, Ilias and Blackburn, Jeremy and Stringhini, Gianluca and Vakali, Athena and Sirivianos, Michael and Kourtellis, Nicolas},
booktitle={11th International Conference on Web and Social Media, ICWSM 2018},
year={2018},
organization={AAAI Press}
}
For any further questions contact a.m.founta at gmail dot com AND markos.charalambous at eecei dot cut dot ac dot cy
You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.
All versions | This version | |
---|---|---|
Views | 212 | 118 |
Downloads | 55 | 32 |
Data volume | 648.1 MB | 406.5 MB |
Unique views | 140 | 92 |
Unique downloads | 35 | 24 |