Published May 4, 2021 | Version v1
Dataset Open

Dataset of user and tweet ids of followers of @nytimes.

  • 1. Laboratoire de Physique Théorique et Modélisation, UMR-8089 CNRS, CY Cergy Paris Université

Description


## Data Format

There are three files containing the ids for different types of Twitter entities.
All files are [xz compressed archives](https://en.wikipedia.org/wiki/XZ_Utils) and
can be unpacked with, e.g., `unxz user_ids.dat.xz`. Each file contains one id per
line, which are sorted ascendingly to improve compressibility.

*  `user_ids.dat.xz`
*  `tweet_ids.dat.xz`
*  `retweet_ids.dat.xz`

The ids of `user_ids.dat.xz` correspond to user_ids of Twitter.
The ids of `tweet_ids.dat.xz` and `retweet_ids.dat.xz` correspond to tweet_ids of Twitter,
but for convenience we split categorized them into retweets and regular tweets.

Further there are two files mapping tweet ids to nyt ids, if they contain a link to an
article of the NYT.

*  `tweet_nyt_ids.dat.xz`
*  `tweet_nyturl_ids.dat.xz`

The file `tweet_nyt_ids.dat.xz` contains id from tweets tweeted by the @nytimes account and in the second column
the id of the linked article (or `None` if the tweet did not link to an article).
The file `tweet_nyturl_ids.dat.xz` contains tweet ids of tweets from @nytimes followers which
contain a link to an article of the NYT and in the second column the id of the article.

Files

readme.md

Files (3.8 GB)

Name Size Download all
md5:589f8e65624b2101203f3c46a7ea31be
1.2 kB Preview Download
md5:895dc57e4a3227118b2dba6cf75ff82c
1.8 GB Download
md5:a5e76639d1ed75883f9d5bbbe7b105a1
1.9 GB Download
md5:addd63933bf7257f026272c7cf5ae3d2
738.4 kB Download
md5:59fae35d6f5014526d5505028279570c
6.5 MB Download
md5:5683e69a544674a16ff8d2bc3c2ac9f4
26.8 MB Download