Dataset for "Mean Birds: Detecting Aggression and Bullying on Twitter"

Despoina Chatzakou; Nicolas Kourtellis; Jeremy Blackburn; Emiliano De Cristofaro; Gianluca Stringhini; Athena Vakali

doi:10.5281/zenodo.1184178

Published February 25, 2018 | Version v1

Dataset Open

Dataset for "Mean Birds: Detecting Aggression and Bullying on Twitter"

1. Aristotle University of Thessaloniki
2. Telefonica Research
3. University College London

In recent years, bullying and aggression against social media users have grown significantly, causing serious consequences to victims of all demographics. Nowadays, cyberbullying affects more than half of young social media users worldwide, suering from prolonged and/or coordinated digital harassment. Also, tools and technologies geared to understand and mitigate it are scarce and mostly ineffective. In this paper, we present a principled and scalable approach to detect bullying and aggressive behaviour on Twitter. We propose a robust methodology for extracting text, user, and network-based attributes, studying the properties of bullies and aggressors, and what features distinguish them from regular users. We nd that bullies post less, participate in fewer online communities, and are less popular than normal users. Aggressors are relatively popular and tend to include more negativity in their posts. We evaluate our methodology using a corpus of 1.6M tweets posted over 3 months, and show that machine learning classication algorithms can accurately detect users exhibiting bullying and aggressive behaviour, with over 90% AUC.

Files

websci_dataset.zip

Files (89.2 kB)

Name	Size	Download all
websci_dataset.zip md5:2530ebb7c3b90412eb91d3a5f1c0b590	89.2 kB	Preview Download

Additional details

European Commission
ENCASE – EnhaNcing seCurity And privacy in the Social wEb: a user centered approach for the protection of minors 691025

	All versions	This version
Views	2,946	2,942
Downloads	744	744
Data volume	68.2 MB	68.2 MB

Dataset for "Mean Birds: Detecting Aggression and Bullying on Twitter"

Creators

Description

Files

websci_dataset.zip

Files (89.2 kB)

Additional details

Funding