Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles

Koloski, Boshko; Stepišnik-Perdih, Timen; Robnik-Šikonja, Marko; Pollak, Senja; Škrlj, Blaž

doi:10.1016/j.neucom.2022.01.096

Published March 25, 2022 | Version v1

Journal article Open

Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles

1. Jožef Stefan Institute
2. University of Ljubljana, Ljubljana, Slovenia

Increasing amounts of freely available data both in textual and relational form offers exploration of richer document representations, potentially improving the model performance and robustness. An emerging problem in the modern era is fake news detection—many easily available pieces of information are not necessarily factually correct, and can lead to wrong conclusions or are used for manipulation. In this work we explore how different document representations, ranging from simple symbolic bag-of-words, to contextual, neural language model-based ones can be used for efficient fake news identification. One of the key contributions is a set of novel document representation learning methods based solely on knowledge graphs, i.e., extensive collections of (grounded) subject-predicate-object triplets. We demonstrate that knowledge graph-based representations already achieve competitive performance to conventionally accepted representation learners. Furthermore, when combined with existing, contextual representations, knowledge graph-based document representations can achieve state-of-the-art performance. To our knowledge this is the first larger-scale evaluation of how knowledge graph-based representations can be systematically incorporated into the process of fake news classification.

Files

Koloski.pdf

Files (2.7 MB)

Name	Size	Download all
Koloski.pdf md5:80b4cae4c073c27abf7133f95a20ed34	2.7 MB	Preview Download

Additional details

EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153: European Commission

	All versions	This version
Views	52	52
Downloads	87	86
Data volume	236.0 MB	233.4 MB

Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles

Creators

Description

Files

Koloski.pdf

Files (2.7 MB)

Additional details

Funding