Published June 1, 2024 | Version v1
Dataset Open

Multilingual Fake News Detection Dataset: Gujarati, Hindi, Marathi, and Telugu

Description

This dataset is designed to support research in fake news detection across four major Indian languages: Gujarati, Hindi, Marathi, and Telugu. The dataset includes a diverse set of news articles collected from various sources, each labeled as either 'fake' or 'real'. The primary goal is to provide a resource that helps in the development and evaluation of natural language processing (NLP) models capable of detecting fake news in these regional languages.

Files

Gujarati_F&R_News.zip

Files (194.2 MB)

Name Size Download all
md5:77fec7e22af5e916816f3dc98a2781a7
42.8 MB Preview Download
md5:3be2e9e1f636472c99308a5eac1bb7e1
59.7 MB Preview Download
md5:5aa6a5904c6812ead2a09448ffa51b38
46.7 MB Preview Download
md5:de7c018e8c9346da73196c2f997fc6df
45.0 MB Preview Download