Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published February 25, 2018 | Version v1
Dataset Open

Dataset for "Mean Birds: Detecting Aggression and Bullying on Twitter"

  • 1. Aristotle University of Thessaloniki
  • 2. Telefonica Research
  • 3. University College London

Description

In recent years, bullying and aggression against social media users have grown significantly, causing serious consequences to victims of all demographics. Nowadays, cyberbullying affects more than half of young social media users worldwide, suering from prolonged and/or coordinated digital harassment. Also, tools and technologies geared to understand and mitigate it are scarce and mostly ineffective. In this paper, we present a principled and scalable approach to detect bullying and aggressive behaviour on Twitter. We propose a robust methodology for extracting text, user, and network-based attributes, studying the properties of bullies and aggressors, and what features distinguish them from regular users. We nd that bullies post less, participate in fewer online communities, and are less popular than normal users. Aggressors are relatively popular and tend to include more negativity in their posts. We evaluate our methodology using a corpus of 1.6M tweets posted over 3 months, and show that machine learning classication algorithms can accurately detect users exhibiting bullying and aggressive behaviour, with over 90% AUC.

Files

websci_dataset.zip

Files (89.2 kB)

Name Size Download all
md5:2530ebb7c3b90412eb91d3a5f1c0b590
89.2 kB Preview Download

Additional details

Funding

ENCASE – EnhaNcing seCurity And privacy in the Social wEb: a user centered approach for the protection of minors 691025
European Commission