Published January 8, 2024
| Version 1.0
Dataset
Restricted
Supplementary material to 'Automatic Identification of Hate Speech – A Case-Study of Alt-Right YouTube Videos'
Description
The associated files have been created for and is analysed in a fortcoming article entitled Automatic Identification of Hate Speech – A Case-Study of Alt-Right YouTube Videos'. The material is divided into six tables as follows:
Sentence top 5% | The 19th 20-quantile predicted most hateful sentences |
Sentence bottom 5% | The bottom 20-quantile predicted moste hatefull sentences (the least likely to contain hatespeech) |
Paragraphs | Prediction and annotation of paragraphs |
Video top 10% | Titles of the top decile predicted hateful videos |
Video bottom 10% | Titles of the bottom decile predicted hateful videos |
Video bottom 10% - Alt right | Titles of the bottom decile predicted hateful videos without History |
The data is uploaded in two formats:
Excel file: Automatic_Detection_of_Hate_Speech_a_Case-Study_of_Alt-Right_Videos.xlsx contains all six tables in one file, with a supplementary codebook.
Tab Separated Values (TSV): Each file correspond to a single sheet from the excel file, and are named accordingly. UTF-8 Encoded.