Dataset Open Access

Astroturf/Legitimate Classification

Ratkiewicz, Jacob; Conover, Michael; Meiss, Mark; Goncalves, Bruno; Flammini, Alessandro; Menczer, Filippo

This is the training data used to produce the results shown in the paper listed below.

  • Source: Sampled public tweets from Twitter streaming API.
  • Date range: September 14 to October 27, 2010.
  • Contains:
    1. data.arff: holds the un-resampled training data.
    2. data_balanced.arff: holds the resampled training data.
    3. data.instance_to_id.pickle: holds a Python pickle relating instance IDs in the data.arff file with Meme IDs in the Truthy database. To view the page for a particular meme ID, go to http://truthy.indiana.edu/m?id=
  • Please cite:
Files (148.2 kB)
Name Size
LICENSE.CC-BY-NC-ND-4.0.txt
md5:9536ae5431be9e61b7e46c13d8074aa4
15.0 kB Download
training_data.tar
md5:9fa04ebc9e24f2906317a861a87a222e
133.1 kB Download
74
10
views
downloads
All versions This version
Views 7474
Downloads 1010
Data volume 976.9 kB976.9 kB
Unique views 6464
Unique downloads 88

Share

Cite as