Conference paper Open Access

The Pushshift Telegram Dataset

Baumgartner, Jason; Zannettou, Savvas; Squire, Megan; Blackburn, Jeremy

JSON-LD ( Export

  "description": "<p>The Pushshift Telegram Dataset</p>\n\n<p>The dataset consists of three files:</p>\n\n<p><em>Accounts.ndjson:&nbsp;</em>Provides data for 2.2M Telegram users that were active in the channels we crawled.</p>\n\n<p><em>Channels.ndjson:&nbsp;</em>Provides data for 28K Telegram channels that we crawled.</p>\n\n<p><em>Messages.ndjson: </em>Provides data for 317M Telegram messages that were posted by 2.2M Telegram users in 28K Telegram channels.</p>\n\n<p>Each file is a newline delimited json (ndjson) file that includes a json object with the data for each account/channel/message. The format of each object is according to the Telethon API (<a href=\"\"></a>), which is a Python interface for Telegram&#39;s API.</p>", 
  "license": "", 
  "creator": [
      "affiliation": "", 
      "@type": "Person", 
      "name": "Baumgartner, Jason"
      "affiliation": "Max Planck Institute", 
      "@type": "Person", 
      "name": "Zannettou, Savvas"
      "affiliation": "Elon University", 
      "@type": "Person", 
      "name": "Squire, Megan"
      "affiliation": "Binghamton University", 
      "@type": "Person", 
      "name": "Blackburn, Jeremy"
  "headline": "The Pushshift Telegram Dataset", 
  "image": "", 
  "datePublished": "2020-01-14", 
  "url": "", 
  "keywords": [
  "@context": "", 
  "identifier": "", 
  "@id": "", 
  "@type": "ScholarlyArticle", 
  "name": "The Pushshift Telegram Dataset"
All versions This version
Views 3,7013,701
Downloads 4,5804,580
Data volume 177.9 TB177.9 TB
Unique views 3,2583,258
Unique downloads 1,8891,889


Cite as