Published July 5, 2017 | Version v1
Dataset Open

Early detection of promoted campaigns on social media

  • 1. School of Informatics and Computing, Indiana University, Bloomington, IN, USA

Description

Please cite our paper as follows, when you are using our dataset:

@article{varol2017early,
  Author = {Onur Varol and Emilio Ferrara and Filippo Menczer and Alessandro Flammini},
  Title = {Early Detection of Promoted Campaigns on Social Media},
  Journal = {EPJ Data Science},
  Url = {}
  Note = {In press},
  Year = {2017},
}

**** FORMAT ****

Information about promoted and organics trends used in this project available in the index file: 
`trend_information.dat`. We also provide `tweet-id` for each tweets used in our experiments and `timeseries` of features for each time window.

trend_information.dat:
  Format: {trend_name} \t {label} \t {trending_tstamp}
  * {trend-name} is the identifier of different trends. Information about each trend can be found in the index file.  
  * label: 1 (organic), 0 (promoted)
  * trending_tstamp provides a UNIX timestamp of the time particular hashtag is trended.

tweet-ids/tweet-ids_{trend-name}.dat.gz:
  Format: Each line contains a single tweet-id
  * {trend-name} is the identifier of different trends. Information about each trend can be found in the index file.

timeseries/timeseries
  Format: {feature-name} \t {timeseries}
  * {feature-name} represent different features extracted for a given trend
  * {timeseries} contain 120 comma seperated numerical value. Trending point locates at index 60.

Files

LICENSE.CC-BY-NC-ND-4.0.txt

Files (100.2 MB)

Name Size Download all
md5:9536ae5431be9e61b7e46c13d8074aa4
15.0 kB Preview Download
md5:32864c7921a525fedb9ac818590de004
100.2 MB Download