Conference paper Open Access
Baumgartner, Jason; Zannettou, Savvas; Squire, Megan; Blackburn, Jeremy
{ "description": "<p>The Pushshift Telegram Dataset</p>\n\n<p>The dataset consists of three files:</p>\n\n<p><em>Accounts.ndjson: </em>Provides data for 2.2M Telegram users that were active in the channels we crawled.</p>\n\n<p><em>Channels.ndjson: </em>Provides data for 28K Telegram channels that we crawled.</p>\n\n<p><em>Messages.ndjson: </em>Provides data for 317M Telegram messages that were posted by 2.2M Telegram users in 28K Telegram channels.</p>\n\n<p>Each file is a newline delimited json (ndjson) file that includes a json object with the data for each account/channel/message. The format of each object is according to the Telethon API (<a href=\"https://docs.telethon.dev/en/latest/\">https://docs.telethon.dev/en/latest/</a>), which is a Python interface for Telegram's API.</p>", "license": "https://creativecommons.org/licenses/by/4.0/legalcode", "creator": [ { "affiliation": "Pushshift.io", "@type": "Person", "name": "Baumgartner, Jason" }, { "affiliation": "Max Planck Institute", "@type": "Person", "name": "Zannettou, Savvas" }, { "affiliation": "Elon University", "@type": "Person", "name": "Squire, Megan" }, { "affiliation": "Binghamton University", "@type": "Person", "name": "Blackburn, Jeremy" } ], "headline": "The Pushshift Telegram Dataset", "image": "https://zenodo.org/static/img/logos/zenodo-gradient-round.svg", "datePublished": "2020-01-14", "url": "https://zenodo.org/record/3607497", "keywords": [ "Telegram", "pushshift" ], "@context": "https://schema.org/", "identifier": "https://doi.org/10.5281/zenodo.3607497", "@id": "https://doi.org/10.5281/zenodo.3607497", "@type": "ScholarlyArticle", "name": "The Pushshift Telegram Dataset" }
All versions | This version | |
---|---|---|
Views | 3,701 | 3,701 |
Downloads | 4,580 | 4,580 |
Data volume | 177.9 TB | 177.9 TB |
Unique views | 3,258 | 3,258 |
Unique downloads | 1,889 | 1,889 |