Dataset Open Access
A subset of the chat logs dump from Twitch used in this work is provided, to help replicate the central findings of this work (https://doi.org/10.5281/zenodo.1182793). Data are aggregated and include the number of messages posted in each channel and the number of users posting them, sampled at intervals of 5 minutes. To protect the identity of the users in this data collection, message contents and user names are not included in this dataset. Stream names have been replaced with numeric IDs.
No additional filtering or data cleaning operation has been applied to this data. Replication code is available on Github (https://github.com/glciampaglia/twitch-overload-replication).
Nematzadeh et al. (2016). arXiv:1610.06497 [cs.SI]