There is a newer version of the record available.

Published January 6, 2019 | Version 1
Dataset Open

Dataset for fake news and articles detection

  • 1. American University of Beirut

Description

We have produced a labeled dataset that presents fake news surrounding the conflict in Syria. The dataset consists of a set of articles/news labeled by 0 (fake) or 1 (credible). Credibility of articles are computed with respect to a ground truth information obtained from the Syrian Violations Documentation Center  (VDC). In particular, for each article, we crowdsource the information extraction (e.g., date, location, Number of casualties) job using the crowdsourcing platform Figure Eight (formally CrowdFlower). Then, we match those articles against the VDC database to be able to deduce whether an article is fake or not. The dataset can be used to train machine learning models to detect fake news. 

 

 

 

Files

FINAL LABELED ARTICLES.csv

Files (1.5 MB)

Name Size Download all
md5:65987ab5b11f055fde28f0f098cabfe7
1.5 MB Preview Download