Published December 14, 2022 | Version Version 2
Dataset Open

TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision

  • 1. Georgia State University

Contributors

Supervisor:

  • 1. Georgia State University

Description

This repository contains the silver standard dataset and code for the paper "TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision".

The file "heuristic_uniq_terms_nd.txt" contains the list of terms used as the heuristic and the file "natural_disasters_ssd_tweetids.tsv" contains the tweet ids in the silver standard dataset. 

To hydrate the tweets, you can use tools like twarc or Social Media Mining toolkit - https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7362951/

Files

heuristic_uniq_terms_nd.txt

Files (16.4 MB)

Name Size Download all
md5:e0bce2b399b2e1c9965def109f5722c7
2.5 kB Preview Download
md5:5eff12eb790b0f2f1b5b5fcf7c2ef5cf
16.4 MB Download