Dataset Open Access

Check Mate: Prioritizing User Generated Multi-Media Content for Fact-Checking

Tarunima Prabhakar; Kruttika Nadig; Anushree Gupta; Denny George

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Tarunima Prabhakar</dc:creator>
  <dc:creator>Kruttika Nadig</dc:creator>
  <dc:creator>Anushree Gupta</dc:creator>
  <dc:creator>Denny George</dc:creator>
  <dc:description>Given volume of content and misinformation on social media, there is a need for systems that can support fact checkers by prioritizing content that needs to be fact checked. Prior research on prioritizing content for fact-checking has focused on news media articles, predominantly in English language. But there is an increasing amount of misinformation in user-generated content. Furthermore, misinformation is generated through information across modalities. In this paper we present a novel dataset that can be used to prioritize check-worthy posts from multi-media content in Hindi. It is unique in its 1) focus on user generated content, 2) multi-modality and 3) Hindi as the primary language of content. In addition, we also provide metadata for each post such as number of shares and likes of the post on ShareChat, a popular Indian social media platform, that allows for correlative analysis around virality and misinformation. </dc:description>
  <dc:description>Additional data/information can be requested by emailing</dc:description>
  <dc:subject>user generated content</dc:subject>
  <dc:subject>claim detection</dc:subject>
  <dc:title>Check Mate: Prioritizing User Generated Multi-Media Content for Fact-Checking</dc:title>
All versions This version
Views 214214
Downloads 3636
Data volume 94.3 GB94.3 GB
Unique views 193193
Unique downloads 3131


Cite as