Published July 29, 2020 | Version v1
Dataset Open

FakeCovid- A Multilingual Cross domain Fact Check Dataset for COVID-19

  • 1. University of Duisburg-Essen, Germany
  • 2. University of Bamberg, Germany

Description

FakeCovid is the first multilingual cross-domain dataset of 7623 fact-checked news articles for COVID-19, collected from 04/01/2020 to 01/07/2020. We have collected the fact-checked articles from 92 fact-checking websites after obtaining references from Poynter and Snopes. We have manually annotated the collected articles into 11 categories of the fact-checked news according to their content. We ultimately generated dataset is in 40 languages from 105 countries. 

Files

FakeCovid_July2020.csv

Files (55.1 MB)

Name Size Download all
md5:c8f2774f9315f6311b2ab78e41ea9bea
55.1 MB Preview Download

Additional details

Related works

Is documented by
Conference paper: https://arxiv.org/pdf/2006.11343.pdf (URL)

Funding

RISE_SMA – RISE Social Media Analytics 823866
European Commission

References

  • Shahi, Gautam Kishore, and Durgesh Nandini. "FakeCovid--A Multilingual Cross-domain Fact Check News Dataset for COVID-19." arXiv preprint arXiv:2006.11343 (2020)