Dataset Open Access

Replication data for Nematzadeh et al. "Information Overload in Group Communication: From Conversation to Cacophony in the Twitch Chat"

Nematzadeh, Azadeh; Ciampaglia, Giovanni Luca; Ahn, Yong-Yeol; Flammini, Alessandro

A subset of the chat logs dump from Twitch used in this work is provided, to help replicate the central findings of this work (https://doi.org/10.5281/zenodo.1182793). Data are aggregated and include the number of messages posted in each channel and the number of users posting them, sampled at intervals of 5 minutes. To protect the identity of the users in this data collection, message contents and user names are not included in this dataset. Stream names have been replaced with numeric IDs. 
No additional filtering or data cleaning operation has been applied to this data. Replication code is available on Github (https://github.com/glciampaglia/twitch-overload-replication).

Files (1.8 GB)
Name Size
counts.h5
md5:7d2b9d9e8b5538d5da8e4dd5fe434065
1.5 GB Download
stats.h5
md5:b850c6227691e3473cce381204584c6a
217.9 MB Download
  • Nematzadeh et al. (2016). arXiv:1610.06497 [cs.SI]

78
23
views
downloads
All versions This version
Views 7878
Downloads 2323
Data volume 22.2 GB22.2 GB
Unique views 7070
Unique downloads 1515

Share

Cite as