Dataset Open Access

TweetsKB (Part 6, Nov 2017 - March 2018)

Fafalios, Pavlos; Iosifidis, Vasileios

Part 6 of TweetsKB (November 2016 - March 2018)

TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 1.5 billion tweets, spanning more than 5 years (February 2013 - March 2018). Metadata information about the tweets as well as extracted entities, sentiments, hashtags and user mentions are exposed in RDF using established RDF/S vocabularies*. Example queries and more information are available through TweetsKB's home page: http://l3s.de/tweetsKB/.

If you use the dataset (or parts of it), please cite the following paper:

Fafalios, P., Iosifidis, V., Ntoutsi, E., & Dietze, S. (2018, June). TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets. In European Semantic Web Conference(pp. 177-190). Springer, Cham.  (pdf - bib)

* For the sake of privacy, we anonymize the tweet IDs and usernames, and we do not provide the text of the tweets.

Files (17.1 GB)
Name Size
month_2017-11.n3.gz
md5:00cf45933198b8deee4302f73ba912a2
3.5 GB Download
month_2017-12.n3.gz
md5:854405ff2c4086c0f8191f351640207a
3.4 GB Download
month_2018-01.n3.gz
md5:6e98385de15ab52cf5a3503c1b6a27f2
3.5 GB Download
month_2018-02.n3.gz
md5:a34abfc4fd96effdd4014288904e881f
3.2 GB Download
month_2018-03.n3.gz
md5:00261309b10660ec3e3a32e31190e66e
3.4 GB Download
28
14
views
downloads
All versions This version
Views 2828
Downloads 1414
Data volume 47.9 GB47.9 GB
Unique views 2323
Unique downloads 66

Share

Cite as