Published January 27, 2025 | Version 1.0.0
Dataset Open

Telegram channels graph dataset

  • 1. ROR icon National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Description

The Telegram channels graph dataset is a collection of data that represents the structure and interactions within a set of 265 724 Telegram channels. It contains information regarding nodes (Telegram channels) and two types of edges (post forwards and URL mentions)

The archive consists of the following key files:

  1. Channels.csv: Each row entry represents a Telegram channel with attributes such as Id", "Name", "Title", and  "CreatedAt" which are obtained through the Telegram API. For nearly half of the entries (132 558 channels), additional data is provided, including the following columns: "P90Views," "P90Forwards," "TopLang," and "SnapshotAt." Fields "P90Views" and "P90Forwards" refer to the 90th percentile of views and forwards for all publications at the moment indicated by "SnapshotAt." "TopLang" represents the ISO language code corresponding to the language most commonly used in the channel's publications, as determined by the Fasttext model. 

  2. Forwards.csv: Each row entry represents the edge of Telegram channel publications forward from one channel to another. The "Count" field indicates the number of  forwards between the channels. There are 2207614 forward type edges.

  3. Mentions.csv: Each row entry represents mentions through URLs other Telegram channel. The "Count" field indicates the number of mentions. There are 8 046 017 Mention type edges.

Files

TelegramChannelsGraphDataset.zip

Files (65.6 MB)

Name Size Download all
md5:d0f1e673261bb58b95f156ef0524848e
65.6 MB Preview Download