Dataset for the paper - Bots, Elections, and Controversies: Twitter Insights from Brazil's Polarised Elections
Description
Dataset for the paper "Diogo Pacheco. 2024. Bots, Elections, and Controversies: Twitter Insights
from Brazil’s Polarised Elections. In Proceedings of the ACM Web Conference
2024 (WWW ’24), May 13–17, 2024, Singapore, Singapore. ACM, New York,
NY, USA. https://doi.org/10.1145/3589334.3645651" (arXiv version - https://doi.org/10.48550/arXiv.2310.09051).
The dataset spans from August 30, 2018, to March 14, 2023. The period encompasses 1,657 days, and the collection process remained active for 94% of this time. This comprehensive effort resulted in the acquisition of a vast dataset comprising 437 million tweets originating from 13 million distinct accounts.
The dataset is composed of CSV files, grouped by year. Each record represents tweets and has: masked 'id_str', masked 'user.id_str', 'timestamp_ms', masked 'retweeted_status.id_str', masked 'quoted_status.id_str', masked 'in_reply_to_status_id_str', and 'botscore' (given by BotometerLite). An example file is given for easy reference.
For more details about the data collection, please refer to the paper.
Files
Files
(8.7 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:46c5518693db587ef5f24b4e53a1dd24
|
994.0 MB | Download |
|
md5:87ef51604aa37def2d4be47d50efc384
|
1.6 GB | Download |
|
md5:7a0c4f8b95a521d66cf61de7446736ae
|
1.9 GB | Download |
|
md5:77af5a43a58d11b47c3ab6245284fbd5
|
1.1 GB | Download |
|
md5:a204a4bd5d639876703437822ded6752
|
2.7 GB | Download |
|
md5:d61adb4b980aaca8eb0b18da417618d4
|
299.1 MB | Download |
|
md5:b5bfbd97575fe6595427908a89342c64
|
2.1 MB | Download |
Additional details
Related works
- Is part of
- Publication: 10.1145/3589334.3645651 (DOI)
- Publication: arXiv:2310.09051 (arXiv)