Published November 7, 2022 | Version v1
Dataset Open

[Dataset] Tweets about COVID-19 Brazilian PCI

Description

Installed in April 2021, the COVID-19 Parliamentary Commission of Inquiry (PCI) aimed to investigate omissions and irregularities committed by the federal government during the COVID pandemic in Brazil, which resulted in the death of more than 660,000 Brazilians and placed it among the countries with the most deaths caused by COVID-19.

This dataset has 3,397,933 tweets, splitted in days and weeks, extracted over a period of 26 weeks. It contains textual data from tweets, data about users (@ and description), and data about interactions between users. It can be used to improve textual cleaning techniques, toxic speech detection, clustering, and even Social Network Analysis and social graph studies. Data format is parquet.

This dataset is part of a [paper](https://doi. org/10.1145/3539637.3556992)[1], published by its author, which aimed to do a social network analysis related to the CPI topic, to investigate evidence of political polarization. The source codes and jupyter notebooks are available on GitHub.

[1] Uniting Politics and Pandemic: a Social Network Analysis on the COVID Parliamentary Commission of Inquiry in Brazil. WebMedia 2022. Lucas Raniére J. Santos, Leandro B. Marinho, Caludio E. C. Campelo.

Files

archive.zip

Files (5.4 GB)

Name Size Download all
md5:89f64f180a7402848f0f5f5907df5df8
5.4 GB Preview Download

Additional details

Related works

Is published in
Conference paper: 10.1145/3539637.3556992 (DOI)

References

  • Uniting Politics and Pandemic: a Social Network Analysis on the COVID Parliamentary Commission of Inquiry in Brazil. WebMedia 2022. Lucas Raniére J. Santos, Leandro B. Marinho, Caludio E. C. Campelo.