Multilingual Detection of Fake News Spreaders via Sparse Matrix Factorization

doi:10.5281/zenodo.4059635

Published September 30, 2020 | Version v1

Conference paper Open

Multilingual Detection of Fake News Spreaders via Sparse Matrix Factorization

1. Jožef Stefan Institute

Fake news is an emerging problem in online news and social media. Efficient detection of fake news spreaders and spurious accounts across multiple languages is becoming an interesting research problem, and is the key focus of this paper. Our proposed solution to PAN 2020 fake news spreaders challenge models the accounts responsible for spreading the fake news by accounting for different types of textual features, decomposed via sparse matrix factorization, to obtain easy-to-learn-from, compact representations, including the information from multiple languages. The key contribution of this work is the exploration of how powerful and scalable matrix factorization-based classification can be in a multilingual setting, where the learner is presented with the data from multiple languages simultaneously. Finally, we explore the joint latent space, where patterns from individual languages are maintained. The proposed approach scored second on the 2020 PAN shared task for identification of fake news spreaders.

Files

koloski_2020a.pdf

Files (733.3 kB)

Name	Size	Download all
koloski_2020a.pdf md5:e51fcd2cbe801a5ce26a5739cb728f03	733.3 kB	Preview Download

Additional details

EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153: European Commission

	All versions	This version
Views	77	76
Downloads	48	48
Data volume	36.7 MB	36.7 MB

Multilingual Detection of Fake News Spreaders via Sparse Matrix Factorization

Creators

Description

Files

koloski_2020a.pdf

Files (733.3 kB)

Additional details

Funding