Published February 21, 2014 | Version v2
Dataset Open

Topical diversity of user interests and content

  • 1. Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University Bloomington


  • Source: Sampled public tweets from Twitter streaming API.
  • Date range: January 1, 2013 to March 31, 2013.
  • Data size: 6.4 GB; about 490 millions tweets.
  • Contains:
    1. Sampled tweets during 3 months.
    2. Each tweet is associated with a timestamp, anonymized user ID, and a list of hashtags.
  • Please cite:
    • Weng, L. and Menczer, F., 2015. Topicality and impact in social media: diverse messages, focused messengers. PloS one, 10(2), p.e0118410. DOI



Files (6.8 GB)

Name Size Download all
15.0 kB Preview Download
6.8 GB Download
1.1 kB Preview Download