Dataset Open Access

Check Mate: Prioritizing User Generated Multi-Media Content for Fact-Checking

Tarunima Prabhakar; Kruttika Nadig; Anushree Gupta; Denny George

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.4032629", 
  "language": "hin", 
  "title": "Check Mate: Prioritizing User Generated Multi-Media Content for Fact-Checking", 
  "issued": {
    "date-parts": [
  "abstract": "<p>Given volume of content and misinformation on social media, there is a need for systems that can support fact checkers by prioritizing content that needs to be fact checked. Prior research on prioritizing content for fact-checking has focused on news media articles, predominantly in English language. But there is an increasing amount of misinformation in user-generated content. Furthermore, misinformation is generated through information across modalities. In this paper we present a novel dataset that can be used to prioritize check-worthy posts from multi-media content in Hindi. It is unique in its 1) focus on user generated content, 2) multi-modality and 3) Hindi as the primary language of content. In addition, we also provide metadata for each post such as number of shares and likes of the post on ShareChat, a popular Indian social media platform, that allows for correlative analysis around virality and misinformation.&nbsp;</p>", 
  "author": [
      "family": "Tarunima Prabhakar"
      "family": "Kruttika Nadig"
      "family": "Anushree Gupta"
      "family": "Denny George"
  "note": "Additional data/information can be requested by emailing", 
  "type": "dataset", 
  "id": "4032629"
All versions This version
Views 213213
Downloads 3636
Data volume 94.3 GB94.3 GB
Unique views 192192
Unique downloads 3131


Cite as