A Fake News Detection and Credibility Ranking Platform for Portuguese Online News
Authors/Creators
- 1. MOG Technologies, Department of Innovation, Maia, Portugal
- 2. ISEP Polytechnic Institute of Porto, Informatics Engineering Department, Portugal
- 3. PORTIC Porto Research, Technology Innovation Center, Polytechnic of Porto, Portuga
Description
The growth of social media has enabled the spread of tendentiously fake news content in a disorganized and fast manner. Despite the extensive research on fake news detection methods, algorithms and applications [1], most of the studies focused on Natural Language Processing (NLP) techniques for English content analysis. Assessing online news in Portuguese language and ranking them based on their credibility is still an open issue. Addressing this gap, the present work proposes a platform for European Portuguese online news credibility classification and ranking based on content and publication metadata. A dataset was built from the collection of online news from 36 sources, comprising content-related data (title, lead, body) and supporting metadata (source, category, year, author). A hierarchical architecture was envisioned and implemented for supervised binary classification probabilities extraction aggregated into multi-classes, resorting to three Machine Learning models: i) focused on textual analysis resorting to several NLP techniques; ii) delving into the news publication metadata; and iii) leveraging the balance between the two prior models’ probability predictions. This architecture minimizes data entropy, maximizing information retrieved, providing two credibility indexes and a fake news detector with 92% test accuracy and 95% deployed proof-of-concept accuracy. These models were integrated into a web service, enabling users to evaluate and rank online news resorting to the implemented architecture. This solution has several application areas, fostering society’s literacy and increased critical thinking, especially amongst younger generations.
Files
zenodo.pdf
Files
(113.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:cbf0613d8c790d10fe0d34945b3b3e4a
|
113.8 kB | Preview Download |