Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.

There is a newer version of the record available.

Published July 24, 2022 | Version 3
Dataset Open

MonkeyPox2022Tweets: The First Public Twitter Dataset on the 2022 MonkeyPox Outbreak

  • 1. University of Cincinnati

Description

Please cite the following paper when using this dataset:

N. Thakur, “MonkeyPox2022Tweets: The first public Twitter dataset on the 2022 MonkeyPox outbreak,” Preprints, 2022, DOI: 10.20944/preprints202206.0172.v2

Abstract

The world is currently facing an outbreak of the monkeypox virus, and confirmed cases have been reported from 74 countries. Following a recent “emergency meeting”, the World Health Organization just declared monkeypox as a global health emergency. As a result, people from all over the world are using social media platforms, such as Twitter, for information seeking and sharing related to the outbreak, as well as for familiarizing themselves with the guidelines and protocols that are being recommended by various policy-making bodies to reduce the spread of the virus. This is resulting in the generation of tremendous amounts of Big Data related to such paradigms of social media behavior. Mining this Big Data and compiling it in the form of a dataset can serve a wide range of use-cases and applications such as analysis of public opinions, interests, views, perspectives, attitudes, and sentiment towards this outbreak. Therefore, this work presents MonkeyPox2022Tweets, an open-access dataset of Tweets related to the 2022 monkeypox outbreak that were posted on Twitter since the first detected case of this outbreak on May 7, 2022. The dataset is compliant with the privacy policy, developer agreement, and guidelines for content redistribution of Twitter, as well as with the FAIR principles (Findability, Accessibility, Interoperability, and Reusability) principles for scientific data management.

Data Description

The dataset consists of a total of 255,363 Tweet IDs of the same number of tweets about monkeypox that were posted on Twitter from 7th May 2022 to 23rd July 2022 (the most recent date at the time of dataset upload). The Tweet IDs are presented in 6 different .txt files based on the timelines of the associated tweets. The following provides the details of these dataset files.

  • Filename: TweetIDs_Part1.txt (No. of Tweet IDs: 13926, Date Range of the associated Tweet IDs: May 7, 2022 to May 21, 2022)
  • Filename: TweetIDs_Part2.txt (No. of Tweet IDs: 17705, Date Range of the associated Tweet IDs: May 21, 2022 to May 27, 2022)
  • Filename: TweetIDs_Part3.txt (No. of Tweet IDs: 17585, Date Range of the associated Tweet IDs: May 27, 2022 to June 5, 2022)
  •  Filename: TweetIDs_Part4.txt (No. of Tweet IDs: 19718, Date Range of the associated Tweet IDs: June 5, 2022 to June 11, 2022)
  • Filename: TweetIDs_Part5.txt (No. of Tweet IDs: 47718, Date Range of the associated Tweet IDs: June 12, 2022 to June 30, 2022)
  • Filename: TweetIDs_Part6.txt (No. of Tweet IDs: 138711, Date Range of the associated Tweet IDs: July 1, 2022 to July 23, 2022)

The dataset contains only Tweet IDs in compliance with the terms and conditions mentioned in the privacy policy, developer agreement, and guidelines for content redistribution of Twitter. The Tweet IDs need to be hydrated to be used. For hydrating this dataset the Hydrator application (link to download and a step-by-step tutorial on how to use Hydrator) may be used.

Files

TweetIDs_Part1.txt

Files (5.3 MB)

Name Size Download all
md5:c98bffbeb19fefd45dc52ed9930b8c9b
292.4 kB Preview Download
md5:e62dafa477203226052972b0db4711a1
371.8 kB Preview Download
md5:5b3e0e0376d71a3543f0f2dba42a51d0
369.3 kB Preview Download
md5:58ad47a1b0f0bd71917a0a9aee03f469
414.1 kB Preview Download
md5:856b3c527967c8981e833d64fe248b99
981.1 kB Preview Download
md5:1dddba86cb81476618aa98132bc8c6af
2.9 MB Preview Download