Published June 11, 2025 | Version v1
Dataset Open

Dataset for: "Using AI to detect misinformation and emotions on Telegram: a comparison with the media"

  • 1. ROR icon University of Castilla-La Mancha
  • 2. ROR icon Valencian International University
  • 3. ROR icon Universidad Internacional De La Rioja
  • 4. ROR icon Universidad de Granada

Description

This dataset contains the raw data used in the article “Using AI to detect misinformation and emotions on Telegram: a comparison with the media”, accepted for publication in index.comunicación. The data includes:
 • Telegram dataset (tg_messages.csv): 54,456 posts extracted from 33 public Telegram channels between 23 July and 16 November 2023, related to the political debate around the Amnesty Law in Spain. Each entry includes message metadata such as channel, date, views, and content.
 • News headlines dataset (Titulares.csv): 46,022 news headlines mentioning “amnesty”, extracted from 377 Spanish national media outlets indexed in MediaCloud, during the same period.
 • Analysis scripts: Available upon request or pending publication in the article’s supplementary materials.


The data was used for topic modelling, sentiment and emotion detection with NLP techniques based on Python libraries like BERTopic and pysentimiento. All data is anonymized and publicly accessible or derived from open sources.

Files

tg_messages.csv

Files (35.0 MB)

Name Size Download all
md5:e49135d7933b7e7f7918a0f7fbbae2f7
29.1 MB Preview Download
md5:837688f229d97273d19808a11700a022
5.9 MB Preview Download

Additional details

Dates

Updated
2023-11-30