FactNews: Sentence-Level Annotated Dataset To Predict Factually and Media Bias
- 1. University of São Paulo
- 2. National University of Singapore
- 3. Federal University of Minas Gerais
Description
Predicting the factuality of news reporting and the bias of media outlets is surely relevant for automated news credibility and fact-checking. While prior work has focused on the veracity of news, we propose a fine-grained reliability analysis of the entire media. Specifically, we study the prediction of sentence-level factuality of news reporting and bias of media outlets, which may explain more accurately the overall reliability of the entire source. We first manually produced a large sentence-level dataset, titled "FactNews", composed of 6,191 sentences expertly annotated according to factuality and media bias definitions from AllSides. As a result, baseline models for sentence-level factuality prediction were presented by fine-tuning BERT. Finally, due to the severity of fake news and political polarization in Brazil, both dataset and baseline were proposed for Portuguese. However, our approach may be applied to any other language.
Files
franciellevargas/FactNews-v1.0.0.zip
Files
(103.8 kB)
Name | Size | Download all |
---|---|---|
md5:301865f90cd30bc09962be9d68c1f58f
|
103.8 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/franciellevargas/FactNews/tree/v1.0.0 (URL)