Published April 20, 2023 | Version v1
Dataset Open

CoRAL: a Context-aware Croatian Abusive Language Dataset

  • 1. University of Essex
  • 2. Queen Mary University of London
  • 3. Queen Mary University of London / Jožef Stefan Institute

Contributors

Project leader:

  • 1. University of Essex
  • 2. Queen Mary University of London
  • 3. Queen Mary University of London / Jožef Stefan Institute

Description

CoRAL is a language and culturally aware Croatian Abusive dataset covering phenomena of implicitness and reliance on local and global context. See our paper: Ravi Shekhar, Mladen Karan and Matthew Purver. CoRAL: a Context-aware Croatian Abusive Language Dataset. Findings of the ACL: AACL-IJCNLP, November 2022, https://aclanthology.org/2022.findings-aacl.21/.

Notes

Funded by the project RobaCOFI (Robust and adaptable comment filtering), which indirectly received funding from the European Union's Horizon 2020 research and innovation action programme via the AI4Media Open Call #1 issued and executed under the AI4Media project (Grant Agreement no. 951911).

Files

CoRAL_ann1.csv

Files (350.1 kB)

Name Size Download all
md5:d067d4a62478f9546f90f2337a1b74d6
91.4 kB Preview Download
md5:f32c0b083e330fef1f1d7854f13911d7
91.6 kB Preview Download
md5:d7ab3a5b27c1027f82f689a37037163d
91.0 kB Preview Download
md5:68696ddb7f6637db48d7d762b53f94d1
67.3 kB Preview Download
md5:65d3616852dbf7b1a6d4b53b00626032
7.0 kB Download
md5:94c62e4e52719009ca3bb95a35598ab2
1.7 kB Preview Download

Additional details

Funding

European Commission
AI4Media - A European Excellence Centre for Media, Society and Democracy 951911

References

  • Ravi Shekhar, Mladen Karan and Matthew Purver (2022). CoRAL: a Context-aware Croatian Abusive Language Dataset. Findings of the ACL: AACL-IJCNLP.