Published January 11, 2018 | Version v1
Dataset Restricted

Posts from a brazilian anonymous imageboard

Creators

Description

This dataset contains text with toxic content and hate speech.

A set of discussion threads published in a brazilian anonymous imageboard, a 4chan-style discussion forum. This dataset includes 158,280 user posts in 4,539 threads published between 18 dec. 2016 and 19 jan. 2017. The data was collected through a web scraper developed for this purpose, which gattered textual content and published date from the posts. Images where not collected due to possible ilegal content.

The data was used in the master degree thesis "Análise das apropriações do anonimato nas subculturas dos imageboards".

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Access is granted only for academic purposes. Please describe your research project to get access.

You are currently not logged in. Do you have an account? Log in here

Additional details

Related works