There is a newer version of the record available.

Published April 26, 2023 | Version v1.0.0
Software Open

FactNews: Sentence-Level Annotated Dataset To Predict Factually and Media Bias

  • 1. University of São Paulo
  • 2. National University of Singapore
  • 3. Federal University of Minas Gerais

Description

Predicting the factuality of news reporting and the bias of media outlets is surely relevant for automated news credibility and fact-checking. While prior work has focused on the veracity of news, we propose a fine-grained reliability analysis of the entire media. Specifically, we study the prediction of sentence-level factuality of news reporting and bias of media outlets, which may explain more accurately the overall reliability of the entire source. We first manually produced a large sentence-level dataset, titled "FactNews", composed of 6,191 sentences expertly annotated according to factuality and media bias definitions from AllSides. As a result, baseline models for sentence-level factuality prediction were presented by fine-tuning BERT. Finally, due to the severity of fake news and political polarization in Brazil, both dataset and baseline were proposed for Portuguese. However, our approach may be applied to any other language. 

Files

franciellevargas/FactNews-v1.0.0.zip

Files (103.8 kB)

Name Size Download all
md5:301865f90cd30bc09962be9d68c1f58f
103.8 kB Preview Download

Additional details