Published March 3, 2026 | Version v1
Dataset Open

Audience Engagement and Narrative Resistance on Telegram: A Dataset of Sentiment Shifts, Semantic Similarity, and Zero-Shot Classification

  • 1. ROR icon Vilnius Gediminas Technical University

Description

I. Data Files (.csv)

  • Merged_Post_Comments_Data_anonymized.csv: The master dataset. It contains merged metrics for articles and comments, including views, reactions, and the probability scores for the six narrative classes (e.g., Military Success, Economic Hardship).

  • article_sentiment_scores_all_extended.csv: Contains sentiment scores and engagement metadata (total reactions, comment counts) for the source posts/articles across different Telegram portals.

  • comment_sentiment_scores_all_extended_ssot.csv: It includes individual sentiment scores, total reactions per comment, and semantic similarity scores relative to the parent article.

  • semantic_similarity_over_time_all.csv: A longitudinal dataset tracking how the semantic alignment of the audience’s conversation shifts in the hours following an article's publication.

II. Analysis Scripts (.py)

  • Zero-Shot_Narrative_Classification.py: The Python pipeline used to categorize Telegram posts into six distinct narrative frames using a valhalla/distilbart-mnli-12-1 (or similar) transformer model.

  • semantic_analysis_4tg_gpu.py: The core computational script used to generate sentiment scores and calculate cosine similarity between article embeddings and comment embeddings using GPU acceleration.

Files

article_sentiment_scores_all_extended_anonymized.csv

Files (528.7 MB)

Name Size Download all
md5:4b4555e5da1b5b1e834c6200a0f61a21
3.5 MB Preview Download
md5:b1c0a58eab2e9f7650225cafb291d780
274.4 MB Preview Download
md5:d5f7ec8fe38be33c37aee3583921776f
7.5 MB Preview Download
md5:0eb0cdeadaef69b7f5390962212a8b8a
32.9 kB Download
md5:f7cfeca685e889f49076c0ddacf3c4c8
243.3 MB Preview Download
md5:289dab496cee7bf890becd842cec7fad
4.3 kB Download