4881008
doi
10.5281/zenodo.4881008
oai:zenodo.org:4881008
Nguyen, Dong
Margetts, Helen
Rossini, Patricia
Tromble, Rebekah
CAD: the Contextual Abuse Dataset
Vidgen, Bertie
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
online hate
machine learning
labelled dataset
online abuse
social media
<p><strong>Introducing CAD: the Contextual Abuse Dataset</strong> Bertie Vidgen, Dong Nguyen, Helen Margetts, Patricia Rossini, Rebekah Tromble, NAACL 2021</p>
<p>Online abuse can inflict harm on users and communities, making online spaces unsafe and toxic. Progress in automatically detecting and classifying abusive content is often held back by the lack of high quality and detailed datasets. We introduce a new dataset of primarily English Reddit entries which addresses several limitations of prior work. It (1) contains six conceptually distinct primary categories as well as secondary categories, (2) has labels annotated in the context of the conversation thread, (3) contains rationales and (4) uses an expert-driven group-adjudication process for high quality annotations. This repository contains the annotated dataset, annotation guidelines and the trained models and their output.</p>
<p><strong>Code</strong>: https://github.com/dongpng/cad_naacl2021</p>
<p><strong>Paper: </strong>https://www.aclweb.org/anthology/2021.naacl-main.182/</p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
Zenodo
2021-05-31
info:eu-repo/semantics/other
4881007
v1.0 and v1.1
1622814519.367699
653223477
md5:ade101318f4e821870f39138e582d780
https://zenodo.org/records/4881008/files/experiments.zip
3553
md5:cbf10b21d62d2b18c5b06e1681b2ee9b
https://zenodo.org/records/4881008/files/README.txt
9941773
md5:63091d7a48bac5666e818fefed1f6575
https://zenodo.org/records/4881008/files/data.zip
public
10.5281/zenodo.4881007
isVersionOf
doi