Dataset Open Access

TweetsKB (Part 2, Mar 2014 - Nov 2014)

Fafalios, Pavlos; Iosifidis, Vasileios

Part 2 of TweetsKB (March 2014 - November 2014)

TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 1.5 billion tweets, spanning more than 5 years (February 2013 - March 2018). Metadata information about the tweets as well as extracted entities, sentiments, hashtags and user mentions are exposed in RDF using established RDF/S vocabularies*. Example queries and more information are available through TweetsKB's home page: http://l3s.de/tweetsKB/.

If you use the dataset (or parts of it), please cite the following paper:

Fafalios, P., Iosifidis, V., Ntoutsi, E., & Dietze, S. (2018, June). TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets. In European Semantic Web Conference(pp. 177-190). Springer, Cham. (pdf - bib)

* For the sake of privacy, we anonymize the tweet IDs and usernames, and we do not provide the text of the tweets.

Files (45.6 GB)
Name Size
month_2014-03.n3.gz
md5:9e664b0d537c6fe4897b47126239fb4a
5.4 GB Download
month_2014-04.n3.gz
md5:8c3f84fda2e579f358558643f9fe383a
5.5 GB Download
month_2014-05.n3.gz
md5:74240d30b390fe30a56d0d9306ca1908
4.7 GB Download
month_2014-06.n3.gz
md5:b9651a3a2a2c2dfa4dce4d18da715944
2.5 GB Download
month_2014-07.n3.gz
md5:7cf0b3266428df1dadec966011f34c1e
5.9 GB Download
month_2014-08.n3.gz
md5:e3be8e987ff9e7aceb02beeaab3f0dbe
5.8 GB Download
month_2014-09.n3.gz
md5:644f2d66837fdd203ccd79f76447c2c7
5.2 GB Download
month_2014-10.n3.gz
md5:c1b3764c304953d50358385098df46a3
5.3 GB Download
month_2014-11.n3.gz
md5:f8080ecbd48d3dd02e1a03015762857d
5.3 GB Download
104
76
views
downloads
All versions This version
Views 10439
Downloads 7633
Data volume 367.4 GB166.5 GB
Unique views 8228
Unique downloads 158

Share

Cite as