[Dataset] Tweets about COVID-19 Brazilian PCI
Description
Installed in April 2021, the COVID-19 Parliamentary Commission of Inquiry (PCI) aimed to investigate omissions and irregularities committed by the federal government during the COVID pandemic in Brazil, which resulted in the death of more than 660,000 Brazilians and placed it among the countries with the most deaths caused by COVID-19.
This dataset has 3,397,933 tweets, splitted in days and weeks, extracted over a period of 26 weeks. It contains textual data from tweets, data about users (@ and description), and data about interactions between users. It can be used to improve textual cleaning techniques, toxic speech detection, clustering, and even Social Network Analysis and social graph studies. Data format is parquet.
This dataset is part of a [paper](https://doi. org/10.1145/3539637.3556992)[1], published by its author, which aimed to do a social network analysis related to the CPI topic, to investigate evidence of political polarization. The source codes and jupyter notebooks are available on GitHub.
[1] Uniting Politics and Pandemic: a Social Network Analysis on the COVID Parliamentary Commission of Inquiry in Brazil. WebMedia 2022. Lucas Raniére J. Santos, Leandro B. Marinho, Caludio E. C. Campelo.
Files
archive.zip
Files
(5.4 GB)
Name | Size | Download all |
---|---|---|
md5:89f64f180a7402848f0f5f5907df5df8
|
5.4 GB | Preview Download |
Additional details
Related works
- Is published in
- Conference paper: 10.1145/3539637.3556992 (DOI)
References
- Uniting Politics and Pandemic: a Social Network Analysis on the COVID Parliamentary Commission of Inquiry in Brazil. WebMedia 2022. Lucas Raniére J. Santos, Leandro B. Marinho, Caludio E. C. Campelo.