Published December 14, 2022
| Version Version 2
Dataset
Open
TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision
Description
This repository contains the silver standard dataset and code for the paper "TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision".
The file "heuristic_uniq_terms_nd.txt" contains the list of terms used as the heuristic and the file "natural_disasters_ssd_tweetids.tsv" contains the tweet ids in the silver standard dataset.
To hydrate the tweets, you can use tools like twarc or Social Media Mining toolkit - https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7362951/
Files
heuristic_uniq_terms_nd.txt
Files
(16.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:e0bce2b399b2e1c9965def109f5722c7
|
2.5 kB | Preview Download |
|
md5:5eff12eb790b0f2f1b5b5fcf7c2ef5cf
|
16.4 MB | Download |