Dataset Open Access
Tarunima Prabhakar; Kruttika Nadig; Anushree Gupta; Denny George
{ "inLanguage": { "alternateName": "hin", "@type": "Language", "name": "Hindi" }, "description": "<p>Given volume of content and misinformation on social media, there is a need for systems that can support fact checkers by prioritizing content that needs to be fact checked. Prior research on prioritizing content for fact-checking has focused on news media articles, predominantly in English language. But there is an increasing amount of misinformation in user-generated content. Furthermore, misinformation is generated through information across modalities. In this paper we present a novel dataset that can be used to prioritize check-worthy posts from multi-media content in Hindi. It is unique in its 1) focus on user generated content, 2) multi-modality and 3) Hindi as the primary language of content. In addition, we also provide metadata for each post such as number of shares and likes of the post on ShareChat, a popular Indian social media platform, that allows for correlative analysis around virality and misinformation. </p>", "license": "https://creativecommons.org/licenses/by/4.0/legalcode", "creator": [ { "@type": "Person", "name": "Tarunima Prabhakar" }, { "@type": "Person", "name": "Kruttika Nadig" }, { "@type": "Person", "name": "Anushree Gupta" }, { "@type": "Person", "name": "Denny George" } ], "url": "https://zenodo.org/record/4032629", "datePublished": "2020-09-15", "keywords": [ "fact-checking", "user generated content", "claim detection" ], "@context": "https://schema.org/", "distribution": [ { "contentUrl": "https://zenodo.org/api/files/ab6b26bc-9439-4a07-b072-bb4f233a7920/CheckMate_UGC_Hindi 2.zip", "encodingFormat": "zip", "@type": "DataDownload" } ], "identifier": "https://doi.org/10.5281/zenodo.4032629", "@id": "https://doi.org/10.5281/zenodo.4032629", "@type": "Dataset", "name": "Check Mate: Prioritizing User Generated Multi-Media Content for Fact-Checking" }
All versions | This version | |
---|---|---|
Views | 213 | 213 |
Downloads | 36 | 36 |
Data volume | 94.3 GB | 94.3 GB |
Unique views | 192 | 192 |
Unique downloads | 31 | 31 |