Datasets used in "Evidential Deep Learning: Enhancing Predictive Uncertainty Estimation for Earth System Science Applications"
Creators
- 1. NCAR
- 2. Cooperative Institute for Severe and High-Impact Weather Research and Operations, National Severe Storms Laboratory, Norman, OK, USA
- 3. CU Boulder
- 4. UCAR
- 5. University of Maryland
- 6. SUNY Albany
- 7. Colorado State University
Description
The precipitation type (p-type) dataset (ptype.parquet) comprises observational weather reports sourced from the Meteorological Phenomena Identification Near the Ground (mPING) project, combined with corresponding numerical weather prediction data from the NOAA Rapid Refresh (RAP) model. These crowd-sourced mPING reports offer precipitation type labels (rain, snow, sleet, and freezing rain) across North America, while the RAP model provides atmospheric data, including temperature, humidity, and wind profiles, on pressure levels.
The RAP data covers the contiguous United States (CONUS) from 2015 to 2022 on an hourly 13km grid. The mPING observations are matched to the nearest RAP grid cell and hour, allowing the two data sources to be merged into a labeled dataset suitable for classification tasks.
The surface layer flux dataset (surface_layer.csv) contains high-frequency meteorological observations spanning from 2013 to 2015, collected at the Cabauw Experimental Site in the Netherlands. It includes measurements of various variables such as temperature, humidity, wind, radiation, and soil moisture, recorded every 10 minutes. The target output encompasses friction velocity, sensible heat, and latent heat.
The code used for processing the datasets and training neural network models is available in the Miles-Guess repository (https://github.com/ai2es/miles-guess).
Files
surface_layer.csv
Files
(5.1 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:dae508d3d7b379337e5d86c49ce2419b
|
4.0 GB | Download |
|
md5:0706c8ecc59b18ed2fb48679dc533e96
|
1.1 GB | Preview Download |