Published November 5, 2021 | Version v1
Conference paper Open

Not all comments are equal: Insights into comment moderation from a topic-aware model

  • 1. University of Helsinki, Helsinki, Finland
  • 2. Queen Mary University of London

Description

Moderation of reader comments is a signifi- cant problem for online news platforms. Here, we experiment with models for automatic mod- eration, using a dataset of comments from a popular Croatian newspaper. Our analy- sis shows that while comments that violate the moderation rules mostly share common linguistic and thematic features, their con- tent varies across the different sections of the newspaper. We therefore make our models topic-aware, incorporating semantic features from a topic model into the classification de- cision. Our results show that topic informa- tion improves the performance of the model, increases its confidence in correct outputs, and helps us understand the model’s outputs.

Files

zosa-et-al21topic.pdf

Files (655.4 kB)

Name Size Download all
md5:79c4aa277b3fcc752dc2d33a9829e26d
655.4 kB Preview Download

Additional details

Funding

EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153
European Commission