Published September 16, 2025 | Version v1
Dataset Open

pre-monsoon storm database for northern Thailand 2015-2024

Contributors

Contact person:

Data collector:

  • 1. ROR icon Naresuan University

Description

*** pre-monsoon storm database for northern Thailand 2015-2024 ***

We compiled a pre-monsoon (summer) storm database for northern Thailand from publicly available reports. The primary source was the Department of Disaster Prevention and Mitigation (DDPM) website, which posts narrative incident summaries (location, date/time, damages, impacts). Additional official sources (provincial/district offices, utilities, etc.) were consulted when they posted incident information to Facebook or YouTube. To increase coverage, we also consulted credible community/volunteer public pages. Critically, the database stores only the source URLs as provenance. No text, photographs, or video files are downloaded, archived, or retained. All fields are coded by researchers from the public postings; media is referenced solely via links.

Event location (spatial footprint).
The minimum recorded unit is district (amphoe) and province; sub-district (tambon) is coded when reported. When official and non-official sources differ, official sources are prioritized. If only non-official sources exist or footprints conflict, we record the largest corroborated set of affected districts (upper bound), with at least two independent URLs when possible. All entries retain URL provenance, poster type (official/volunteer), and access date.

Damages and impacts.
Counts for affected houses, agriculture, livestock, and assets are taken from official sources when available. When multiple district-level figures exist, we retain the highest reported value as a conservative upper bound and flag uncertainty. We note remaining spatial/reporting uncertainty and plan iterative verification; only URLs are stored.

Hail occurrence and size.
DDPM posts indicating hail presence/absence are treated as primary evidence. Volunteer posts (linked by URL) are used to corroborate and geolocate events. Text-only size descriptions (e.g., fruit/body-part analogies) are coded as low-confidence; Thai coin references are mapped to standard diameters to enable quantitative use. When conflicting sizes appear across sources, we record the largest corroborated size and retain all URLs. We do not store media; we only record whether a post indicates photo/video evidence.

Event timing.
Official timestamps are prioritized. If multiple sources disagree, we define a time window from the earliest reported start to the latest reported end. Vague reports (“afternoon,” “evening”) are coded into coarse bins. Non-contiguous episodes on the same day (e.g., late afternoon and nighttime) are split into separate events after cross-checking URLs, consistent with organized convective line (squall line) passage.

Governance and ethics. All data are from public web pages; we store URLs only, no personal identifiers, and we restrict location to administrative units (district/sub-district). Fields include URL, platform, poster type, post date (as displayed), access date, evidence type (text/photo/video indicated), and confidence flags. This conservative, rules-based approach supports transparency, reproducibility, and future QC while respecting content ownership and platform policies.

Files

00_PP1_demonstration_database_usage_googleColab.ipynb

Files (729.0 kB)

Name Size Download all
md5:83276597d21694d46378f4776566cff3
68.4 kB Preview Download
md5:2183cd79f8d75a346dabecd15790b963
660.6 kB Preview Download

Additional details

Additional titles

Subtitle (Thai)
pre-monsoon storm database events for northern Thailand