Published January 13, 2020 | Version v1
Dataset Open

A large-scale Twitter dataset for drug safety applications mined from publicly existing resources

  • 1. Georgia State University

Contributors

Contact person:

  • 1. Georgia State University

Description

This dataset consists of 1,181,993 Tweet Ids, obtained as a result from the paper - A large-scale Twitter dataset for drug safety applications mined from publicly existing resources.

The Tweet Ids can be hydrated using  twarc, a command line tool and Python library for archiving Twitter JSON data. Please follow these instructions to install twarc - https://github.com/DocNow/twarc

It will take less than 4 hours to hydrate the tweet ids using twarc. 

If you intend to use the dataset, please cite the paper. 

Files

acm_tweetids_drug-related.txt

Files (22.5 MB)

Name Size Download all
md5:90f2f14be2af3ea83da739aeb45ba6ae
22.5 MB Preview Download