Dataset Open Access

Replication data for Nematzadeh et al. "Information Overload in Group Communication: From Conversation to Cacophony in the Twitch Chat"

Nematzadeh, Azadeh; Ciampaglia, Giovanni Luca; Ahn, Yong-Yeol; Flammini, Alessandro

A subset of the chat logs dump from Twitch used in this work is provided, to help replicate the central findings of this work (https://doi.org/10.5281/zenodo.1182793). Data are aggregated and include the number of messages posted in each channel and the number of users posting them, sampled at intervals of 5 minutes. To protect the identity of the users in this data collection, message contents and user names are not included in this dataset. Stream names have been replaced with numeric IDs. 
No additional filtering or data cleaning operation has been applied to this data. Replication code is available on Github (https://github.com/glciampaglia/twitch-overload-replication).

Files (1.8 GB)
Name Size
counts.h5
md5:7d2b9d9e8b5538d5da8e4dd5fe434065
1.5 GB Download
stats.h5
md5:b850c6227691e3473cce381204584c6a
217.9 MB Download
  • Nematzadeh et al. (2016). arXiv:1610.06497 [cs.SI]

126
30
views
downloads
All versions This version
Views 126126
Downloads 3030
Data volume 29.0 GB29.0 GB
Unique views 117117
Unique downloads 2020

Share

Cite as