Published February 13, 2019 | Version v1
Dataset Open

Newly Emerged Rumors in Twitter

  • 1. Federal University of Rio de Janeiro

Description

*** Newly Emerged Rumors in Twitter ***

These 12 datasets are the results of an empirical study on the spreading process of newly emerged rumors in Twitter. Newly emerged rumors are those rumors whose rise and fall happen in a short period of time, in contrast to the long standing rumors. Particularly, we have focused on those newly emerged rumors which have given rise to an anti-rumor spreading simultaneously against them. The story of each rumor is as follow :

1- Dataset_R1 : The National Football League team in Washington D.C. changed its name to Redhawks.

2- Dataset_R2 : A Muslim waitress refused to seat a church group at a restaurant, claiming "religious freedom" allowed her to do so.

3- Dataset_R3 : Facebook CEO Mark Zuckerberg bought a "super-yacht" for $150 million.

4- Dataset_R4 : Actor Denzel Washington said electing President Trump saved the U.S. from becoming an "Orwellian police state."

5- Dataset_R5 : Joy Behar of "The View" sent a crass tweet about a fatal fire in Trump Tower.

6- Dataset_R6 : Harley-Davidson's chief executive officer Matthew Levatich called President Trump "a moron."

7- Dataset_R7 : The animated children's program 'VeggieTales' introduced a cannabis character in August 2018.

8- Dataset_R8 : Michael Jordan resigned from the board at Nike and took his Air Jordan line of apparel with him.

9- Dataset_R9 : In September 2018, the University of Alabama football program ended its uniform contract with Nike, in response to Nike's endorsement deal with Colin Kaepernick.

10- Dataset_R10 : During confirmation hearings for Supreme Court nominee Brett Kavanaugh, congressional Democrats demanded that the nominee undergo DNA testing to prove he is not Adolf Hitler.

11- Dataset_R11 : Singer Michael Bublé's upcoming album will be his last, as he is retiring from making music.Singer Michael Bublé's upcoming album will be his last, as he is retiring from making music.

12- Dataset_R12 : A screenshot from MyLife.com confirms that mail bomb suspect Cesar Sayoc was registered as a Democrat.

 

The structure of excel files for each dataset is as follow :

-   Each row belongs to one captured tweet/retweet related to the rumor, and each column of the dataset presents a specific information about the tweet/retweet. These columns from left to right present the following information about the tweet/retweet : 

- User ID (user who has posted the current tweet/retweet)

- The description sentence in the profile of the user who has published the tweet/retweet

- The number of published tweet/retweet by the user at the time of posting the current tweet/retweet

- Date and time of creation of the the account by which the current tweet/retweet has been posted 

- Language of the tweet/retweet

- Number of followers 

- Number of followings (friends)

- Date and time of posting the current tweet/retweet

- Number of like (favorite) the current tweet had been acquired before crawling it

- Number of times the current tweet had been retweeted before crawling it

- Is there any other tweet inside of the current tweet/retweet (for example this happens when the current tweet is a quote or reply or retweet)

- The source (OS) of device by which the current tweet/retweet was posted

- Tweet/Retweet ID

- Retweet ID (if the post is a retweet then this feature gives the ID of the tweet that is retweeted by the current post)

- Quote ID (if the post is a quote then this feature gives the ID of the tweet that is quoted by the current post)

- Reply ID (if the post is a reply then this feature gives the ID of the tweet that is replied by the current post)

- Frequency of tweet occurrences which means the number of times the current tweet is repeated in the dataset (for example the number of times that a tweet exists in the dataset in the form of retweet posted by others)

- State of the tweet which can be one of the following forms (achieved by an agreement between the annotators) :

          r : The tweet/retweet is a rumor post

          a : The tweet/retweet is an anti-rumor post

          q : The tweet/retweet is a question about the rumor, however neither confirm nor deny it

          n : The tweet/retweet is not related to the rumor (even though it contains the queries related to the rumor, but does not refer                      to the rumor)

 

 

 

 

Files

Files (10.6 MB)

Name Size Download all
md5:720c70df1d2eeb2bc15f1d792a3c9739
1.5 MB Download
md5:6a137ed815c5a1bf0303f61eff762f03
229.3 kB Download
md5:0bc0d19750eefe96696fe1997364a1ba
199.8 kB Download
md5:13f42aa3df0a4473b59fab803dc6af44
5.9 MB Download
md5:65fab6a706e2d7f4040701306d0d35d7
102.8 kB Download
md5:88ab4fb9b8da0106fdb63522d01c698e
185.4 kB Download
md5:dc443e0a4c2d37758d3bd6ab93129dd8
195.5 kB Download
md5:6aa916020224eb24c3e1739c18c52145
125.7 kB Download
md5:bf106b79ffbdf258d35bf12d91d4d66a
146.9 kB Download
md5:f2cfdf7a1d642ff81bb1bff5b1d1ef2d
143.2 kB Download
md5:075d859d2491567d18efc789a3a0b116
1.6 MB Download
md5:4a65f51626eb40f81a873e2ede220a45
242.6 kB Download