Published November 4, 2025
| Version v1
Dataset
Open
PreDist Dataset - Operational data of district heating substations labelled with faults and maintenance information
Authors/Creators
Description
This dataset consists of operational data and labels based on incident reports and maintenance data of district heating substations of enercity Netz GmbH. The labels are available as a list of ‘disturbances’ as well as a list of fault reports including a short description and problem category, which can be used to develop (early) fault detection models for district heating substations. In addition, fault labels and monitoring potential were added to the reports where possible. The dataset is published together with the paper "Enabling Predictive Maintenance in District Heating Substations: A Labelled Dataset and Fault Detection Evaluation Framework based on Service Data", which explains the dataset in detail. When referring to this dataset, please cite the paper mentioned in the related work section.
The PreDist dataset contains time series of 93 district heating substations from two manufacturers, M1 and M2, each time series spanning different lengths of time, depending on when the substation was ‘digitised’. Both sub-datasets contain a list of faults (based on incident reports), a list of disturbances (incident reports, and corrective and preventive maintenance tasks and activities), feature descriptions and a list of pre-defined ‘normal events’, which can be used in addition to the faults to evaluate normal behaviour models.
Files
predist_dataset.zip
Files
(266.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:a99bf6aedaffd9a4b27ad88959130d20
|
266.8 MB | Preview Download |
|
md5:328e77a9cbe3a3b7626c5c9bc8d111d5
|
8.9 kB | Preview Download |
Additional details
Related works
- Is described by
- Journal article: arXiv:2511.14791 (arXiv)
Funding
- Federal Ministry for Economic Affairs and Climate Action
- PreDist - Prädikative Wartung und Instandhaltung von HAST als Teil eines Fernwärmesystems mit Hilfe von Grey-Box-Verfahren; Teilprojekt: Erarbeitung und Ausbau der Machine-Learning-Fähigkeiten 03EN3082