EngRAD: Multivariate Weather Measurements Across England
Description
EngRAD Dataset
The EngRAD dataset contains measurements of 5 different weather variables collected at 487 grid points in England from 2018 to 2020.
Dataset Overview
Data has been provided by Open-Meteo and licensed under Attribution 4.0 International (CC BY 4.0). The numerical weather prediction model used to generate the data is ECMWF IFS, which has a spatial resolution of 9 km. The grid points are located in correspondence with cities. Each point is associated with location information such as geographic coordinates, elevation, closest city, and the county to which it belongs. The physical variables collected are:
- Air temperature at 2 meters above ground (°C)
- Relative humidity at 2 meters above ground (%)
- Summation of total precipitation (rain, showers, snow) during the preceding hour (mm)
- Total cloud cover (%)
- Global horizontal irradiance (W/m²)
These variables are typically of interest in applications related to solar radiation, such as solar power production.
Channels:
The data.h5
file contains two tables, accessible by the following keys:
-
data
: Contains the measurements for each point across different weather variables.temperature_2m
: Air temperature at 2 meters above ground (°C). Instantaneous measurement.relative_humidity_2m
: Relative humidity at 2 meters above ground (%). Instantaneous measurement.precipitation
: Total precipitation (rain, showers, snow) sum of the preceding hour (mm). Preceding hour sum.cloud_cover
: Total cloud cover as an area fraction (%). Instantaneous measurement.shortwave_radiation
: Global horizontal irradiation (GHI) (W/m²). Preceding hour mean.
-
metadata
: Contains the following detailed information for each point:city
: The name of the city where the measurement point is located.county
: The county in which the city is situated.admin_name
: The administrative name associated with the city or region.lat
: The latitude coordinate of the measurement point.lon
: The longitude coordinate of the measurement point.elevation
: The elevation (in meters) above sea level at the measurement point.population
: The population of the city where the measurement point is located.
Dataset size
- Time steps: 26304
- Points: 487
- Channels: 5
- Sampling rate: 1 hour
- Missing values: 0.00%
Credits
This dataset has been introduced in the paper:
Ivan Marisca, Cesare Alippi, and Filippo Maria Bianchi. "Graph-based forecasting with missing data through spatiotemporal downsampling." Proceedings of the 41st International Conference on Machine Learning, PMLR 235:34846-34865, 2024.
Please consider citing the paper if you use the dataset for your research.
@inproceedings{marisca2024graph,
title = {Graph-based Forecasting with Missing Data through Spatiotemporal Downsampling},
author = {Marisca, Ivan and Alippi, Cesare and Bianchi, Filippo Maria},
booktitle = {Proceedings of the 41st International Conference on Machine Learning},
pages = {34846--34865},
year = {2024},
volume = {235},
series = {Proceedings of Machine Learning Research},
publisher = {PMLR}
}
Files
Files
(111.1 MB)
Name | Size | Download all |
---|---|---|
md5:e311f7587447154859ce361eea51409a
|
111.1 MB | Download |
Additional details
Software
- Repository URL
- https://github.com/marshka/hdtts
- Programming language
- Python
- Development Status
- Active