Multilingual Fake News Detection Dataset: Gujarati, Hindi, Marathi, and Telugu

Patil, Kailas; Parshv, Gandhi; Abhishek, Chauhan; Vaibhav, Patil; Ameya, Pawar

doi:10.5281/zenodo.11408513

Published June 1, 2024 | Version v1

Dataset Open

Multilingual Fake News Detection Dataset: Gujarati, Hindi, Marathi, and Telugu

1. Vishwakarma University

This dataset is designed to support research in fake news detection across four major Indian languages: Gujarati, Hindi, Marathi, and Telugu. The dataset includes a diverse set of news articles collected from various sources, each labeled as either 'fake' or 'real'. The primary goal is to provide a resource that helps in the development and evaluation of natural language processing (NLP) models capable of detecting fake news in these regional languages.

Files

Gujarati_F&R_News.zip

Files (194.2 MB)

Name	Size	Download all
Gujarati_F&R_News.zip md5:77fec7e22af5e916816f3dc98a2781a7	42.8 MB	Preview Download
Hindi_F&R_News.zip md5:3be2e9e1f636472c99308a5eac1bb7e1	59.7 MB	Preview Download
Marathi_F&R_News.zip md5:5aa6a5904c6812ead2a09448ffa51b38	46.7 MB	Preview Download
Telugu_F&R_News.zip md5:de7c018e8c9346da73196c2f997fc6df	45.0 MB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	3,305	3,305
Downloads	2,779	2,779
Data volume	161.8 GB	161.8 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

Gujarati, Hindi, Telugu, Marathi

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: June 1, 2024
Modified: June 1, 2024

Multilingual Fake News Detection Dataset: Gujarati, Hindi, Marathi, and Telugu

Authors/Creators

Description

Files

Gujarati_F&R_News.zip

Files (194.2 MB)