Published February 21, 2014 | Version v2
Dataset Open

Topical diversity of user interests and content

  • 1. Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University Bloomington

Description

  • Source: Sampled public tweets from Twitter streaming API.
  • Date range: January 1, 2013 to March 31, 2013.
  • Data size: 6.4 GB; about 490 millions tweets.
  • Contains:
    1. Sampled tweets during 3 months.
    2. Each tweet is associated with a timestamp, anonymized user ID, and a list of hashtags.
  • Please cite:
    • Weng, L. and Menczer, F., 2015. Topicality and impact in social media: diverse messages, focused messengers. PloS one, 10(2), p.e0118410. DOI

Files

LICENSE.CC-BY-NC-ND-4.0.txt

Files (6.8 GB)

Name Size Download all
md5:9536ae5431be9e61b7e46c13d8074aa4
15.0 kB Preview Download
md5:1fdc922693a10db9fd838de00bf1cb69
6.8 GB Download
md5:afcf3e81392637eeb6a353756f9c52ce
1.1 kB Preview Download