Dataset Open Access

TweetsKB (Part 4, Jan 2016 - Nov 2016)

Fafalios, Pavlos; Iosifidis, Vasileios

Part 4 of TweetsKB (January 2016 - November 2016)

TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 1.5 billion tweets, spanning more than 5 years (February 2013 - March 2018). Metadata information about the tweets as well as extracted entities, sentiments, hashtags and user mentions are exposed in RDF using established RDF/S vocabularies*. Example queries and more information are available through TweetsKB's home page: http://l3s.de/tweetsKB/.

If you use the dataset (or parts of it), please cite the following paper:

Fafalios, P., Iosifidis, V., Ntoutsi, E., & Dietze, S. (2018, June). TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets. In European Semantic Web Conference(pp. 177-190). Springer, Cham. (pdf - bib)

* For the sake of privacy, we anonymize the tweet IDs and usernames, and we do not provide the text of the tweets.

Files (49.6 GB)
Name Size
month_2016-01.n3.gz
md5:ac8c68be5dc0f0fc16c195a293875dbe
4.8 GB Download
month_2016-02.n3.gz
md5:b097075f7ba690334feb3d4f6303cb1f
4.5 GB Download
month_2016-03.n3.gz
md5:02aafe2f02fc8580b52ea0aef3c31a1e
4.7 GB Download
month_2016-04.n3.gz
md5:e9ef1ed405e4fb77ace48ea5c848c81e
4.6 GB Download
month_2016-05.n3.gz
md5:346f0e158dbc6d18ff6c5090bcb75002
4.6 GB Download
month_2016-06.n3.gz
md5:c4fe8c5099600577e8254938d22f8124
4.6 GB Download
month_2016-07.n3.gz
md5:8016d40d78037dfedd0d8d569491b38a
4.8 GB Download
month_2016-08.n3.gz
md5:8245e5f1fc5de426526b913cad5368ea
4.3 GB Download
month_2016-09.n3.gz
md5:5ed32dc4adb463d233d3e115fcf2d7bb
4.3 GB Download
month_2016-10.n3.gz
md5:254e81914359c3b6d5febd5b130cd572
4.3 GB Download
month_2016-11.n3.gz
md5:203480a1fbc8a95391f9e7b79fedaf41
4.2 GB Download
69
100
views
downloads
All versions This version
Views 6921
Downloads 10076
Data volume 445.6 GB348.6 GB
Unique views 5315
Unique downloads 2015

Share

Cite as