Published January 15, 2021
| Version v1
Dataset
Open
SWSR: A Chinese Dataset and Lexicon for Sexist Hate Speech Detection
Authors/Creators
- 1. Queen Mary University of London
- 2. Oxford Brookes University
Description
Our repository presents the Sina Weibo Sexism Review (SWSR) dataset containing sexism-related posts in Chinese collected from Sina Weibo, as well as the Chinese lexicon SexHateLex. SWSR dataset consists of two files: hateWeibo.csv and hateComment.csv, and SexHateLex lexicon contains a list of 3016 abusive terms in the file SexHateLex.txt.
Files
hateComment.csv
Additional details
References
- A. Jiang, X. Yang, Y. Liu and A. Zubiaga (2021). SWSR: A Chinese Dataset and Lexicon for Sexist Hate Speech Detection. Under review.