HaterNet a system for detecting and analyzing hate speech in Twitter

Lara Quijano-Sanchez; Juan Carlos Pereira Kohatsu; Federico Liberatore; Miguel Camacho-Collados

doi:10.5281/zenodo.2592149

Published March 13, 2019 | Version 1.0

Dataset Open

HaterNet a system for detecting and analyzing hate speech in Twitter

1. Universidad Autonoma de Madrid
2. Copmlutense university of Madrid
3. State Secretariat for Security Interior Ministry, Madrid, Spain

This dataset consists of two corpuses used in the paper "Detecting and analyzing hate speech in Twitter: HaterNet a system in the Spanish prevention of hate crime office". A first one based on tweets collected at different random dates between February 2017 and December 2017 with a final size of 2 million tweets. A second one with 6,000 tweets labeled as described in the paper as hate containing or not.

Files

labeled_corpus_6K.txt

Files (137.2 MB)

Name	Size	Download all
labeled_corpus_6K.txt md5:22b104d1d1f2f67fcbc616dfbcf7025b	878.4 kB	Preview Download
raw_corpus_2M.zip md5:93b37cbdb7262e302b3c9e135fcd0c9c	136.3 MB	Preview Download

Views

10K

Downloads

Show more details

	All versions	This version
Views	5,761	5,727
Downloads	9,553	9,484
Data volume	108.7 GB	108.1 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

Spanish

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: June 5, 2019
Modified: January 24, 2020

HaterNet a system for detecting and analyzing hate speech in Twitter

Authors/Creators

Description

Files

labeled_corpus_6K.txt

Files (137.2 MB)