There is a newer version of the record available.

Published September 6, 2022 | Version v4
Dataset Open

Weather prediction dataset

  • 1. Centre for Digitalization and Digitality, University of Applied Sciences Düsseldorf
  • 2. Netherlands eScience Center
  • 3. Helmholtz AI
  • 4. Department of Computer Science, Aberystwyth University

Description

Dataset created for machine learning and deep learning training and teaching purposes.
Can for instance be used for classification, regression, and forecasting tasks.
Complex enough to demonstrate realistic issues such as overfitting and unbalanced data, while still remaining intuitively accessible.

ORIGINAL DATA TAKEN FROM:

EUROPEAN CLIMATE ASSESSMENT & DATASET (ECA&D), file created on 22-04-2021
THESE DATA CAN BE USED FREELY PROVIDED THAT THE FOLLOWING SOURCE IS ACKNOWLEDGED:

Klein Tank, A.M.G. and Coauthors, 2002. Daily dataset of 20th-century surface
air temperature and precipitation series for the European Climate Assessment.
Int. J. of Climatol., 22, 1441-1453.
Data and metadata available at http://www.ecad.eu

For more information see metadata.txt file.
The dataset has also been presented at the Teaching Machine Learning Workshop at ECML 2022: https://teaching-ml.github.io/2022/.

The Python code used to create the weather prediction dataset from the ECA&D data can be found on GitHub: https://github.com/florian-huber/weather_prediction_dataset
(this repository also contains Jupyter notebooks with teaching examples)

Versions:

  • v4: to be more future proof in times of climate change/crisis --> "BBQ weather" prediction is now "picnic weather" prediction. Data itself remains unchanged.
  • v3: added "light" version of the dataset with less features (only 11 locations and fewer variables, reduction from 163 to 89 features) --> This is meant to be used if training times for hands-on session is becoming an issues
  • v2:  now also contains additional `BBQ_weather` labels, the dataset itself has not changed between versions v1 and v2

Files

metadata.txt

Files (5.0 MB)

Name Size Download all
md5:469f459dd7aa7b10a131d59ff1aefc01
4.1 kB Preview Download
md5:94cf8d8d1f6233ebde011d6062b1af5d
2.8 MB Preview Download
md5:ae96c3912f24caa097867e9c4da034d8
1.5 MB Preview Download
md5:40114391d126ec09993b41447d101038
337.8 kB Preview Download
md5:e9d63bdd7522d91846de24a34b3e7f98
394.3 kB Preview Download