Dataset Open Access

KDD Cup Dataset (with Missing Values)

Godahewa, Rakshitha; Bergmeir, Christoph; Webb, Geoff; Hyndman, Rob; Montero-Manso, Pablo

This dataset was used in the KDD Cup 2018 forecasting competition. It contains long hourly time series representing the air quality levels in 59 stations in 2 cities: Beijing (35 stations) and London (24 stations) from 01/01/2017 to 31/03/2018. The air quality level is represented in multiple measurements such as PM2.5, PM10, NO2, CO, O3 and SO2.

The dataset uploaded here contains 270 hourly time series which have been categorized using city, station name and air quality measurement.

Files (2.5 MB)
Name Size
2.5 MB Download
  • Kdd cup 2018. URL

All versions This version
Views 576139
Downloads 291135
Data volume 1.7 GB331.7 MB
Unique views 465119
Unique downloads 251111


Cite as