Published February 16, 2024 | Version v1
Dataset Open

Dataset for the paper - Bots, Elections, and Controversies: Twitter Insights from Brazil's Polarised Elections

  • 1. ROR icon University of Exeter

Description

Dataset for the paper "Diogo Pacheco. 2024. Bots, Elections, and Controversies: Twitter Insights
from Brazil’s Polarised Elections. In Proceedings of the ACM Web Conference
2024 (WWW ’24), May 13–17, 2024, Singapore, Singapore. ACM, New York,
NY, USA. https://doi.org/10.1145/3589334.3645651" (arXiv version - https://doi.org/10.48550/arXiv.2310.09051).

The dataset spans from August 30, 2018, to March 14, 2023. The period encompasses 1,657 days, and the collection process remained active for 94% of this time. This comprehensive effort resulted in the acquisition of a vast dataset comprising 437 million tweets originating from 13 million distinct accounts.

The dataset is composed of CSV files, grouped by year. Each record represents tweets and has: masked 'id_str', masked  'user.id_str', 'timestamp_ms', masked 'retweeted_status.id_str', masked 'quoted_status.id_str', masked  'in_reply_to_status_id_str', and 'botscore' (given by BotometerLite). An example file is given for easy reference.

For more details about the data collection, please refer to the paper.

Files

Files (8.7 GB)

Name Size Download all
md5:46c5518693db587ef5f24b4e53a1dd24
994.0 MB Download
md5:87ef51604aa37def2d4be47d50efc384
1.6 GB Download
md5:7a0c4f8b95a521d66cf61de7446736ae
1.9 GB Download
md5:77af5a43a58d11b47c3ab6245284fbd5
1.1 GB Download
md5:a204a4bd5d639876703437822ded6752
2.7 GB Download
md5:d61adb4b980aaca8eb0b18da417618d4
299.1 MB Download
md5:b5bfbd97575fe6595427908a89342c64
2.1 MB Download

Additional details

Related works

Is part of
Publication: 10.1145/3589334.3645651 (DOI)
Publication: arXiv:2310.09051 (arXiv)