Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published August 25, 2022 | Version 2.1.0
Dataset Open

Thermal Bridges on Building Rooftops - Hyperspectral (RGB + Thermal + Height) drone images of Karlsruhe, Germany, with thermal bridge annotations

  • 1. Karlsruhe Institute of Technology, Germany
  • 2. Helmholtz AI, Karlsruhe Institute of Technology, Germany
  • 3. University of Southern California, USA
  • 4. Helmholtz Metadata Collaboration, Karlsruhe Institute of Technology, Germany

Description

Overview:

The dataset of Thermal Bridges on Building Rooftops (TBBR dataset) consists of annotated combined RGB and thermal drone images with a height map. All images were converted to a uniform format of 3000x4000 pixels, aligned, and cropped to 2680x3370 to remove empty borders. See the "Usage" section below for details about the stored formats made available here.

The raw images for our dataset were recorded with a normal (RGB) and a FLIR-XT2 (thermal) camera on a DJI M600 drone. They show six large building blocks of around 20 buildings per block recorded in the city centre of the German city Karlsruhe east of the market square. Because of a high overlap rate of the images, the same buildings are on average recorded from different angles in different images about 20 times.

All images were recorded during a drone flight on March 19, 2019 from 7 a.m. to 8 a.m. At this time, temperatures were between 3.78 ° C and 4.97 ° C, humidity between 80% and 98%. There was no rain on the day of the flight, but there was 2.3mm/m² 48 hours beforehand. For recording the thermographic images an emissivity of 1.0 was set. The global radiation during this period was between 38.59 W / m² and 120.86 W / m². No direct sunlight can be seen visually on any of the recordings.

The dataset contains 926 images with a total of 6,927 annotations of thermal bridges on rooftops, split into train and test subsets with 723 (5,614) and 203 (1,313) images (annotations), respectively. The annotations only include thermal bridges that are visually identifiable with the human eye. Because of the aforementioned image overlap, each thermal bridge is annotated multiple times from different angles.

For the annotation of the thermal images the image processing program VGG Image Annotator from the Visual Geometry Group, version 2.0.10, was used. The thermal bridge annotations are outlined with polygon shapes. These polygon lines were placed as close as possible but outside the area of significant temperature increase. If a detected thermal bridge was partially covered by another building component located in the foreground, the thermal bridge was also marked across the covering in case of minor coverings. Adjacent thermal bridges, which affect different rooftop components, were annotated separately. For example, a window with poor insulation of the window reveal located in the area of a poorly insulated roof is annotated individually. There is no overlap between annotated areas. While each image contains annotations, they also include thermal bridges present that are not annotated.

Usage:

Each compressed archive file represents one of the six flight paths. For the related publication the final path (Flug1_105Media) was used as a hold-out test sample. The archives contain Numpy files (one per image) of shape (2680, 3370, 5), where the final dimension is the colour channel in the format [B, G, R, Thermal, Height].

Archives were compressed using ZStandard compression. They can be decompressed in a terminal by running e.g.

tar -I zstd -xvf Flug1_105Media.tar.zst

these will be decompressed into the file structure:

images/
└── Flug1_105Media/
    └── DJI_0004_R.npy
    └── DJI_0006_R.npy
    └── ...

Corresponding annotations are provided in the COCO JSON format. There is one file for training (Flug1_100Media - Flug1_104Media blocks) and one for test (Flug1_105Media block). They contain a single class (thermal bridge) and expect the folder structure shown below.

Note: The annotation files contain relative paths to numpy files, in case of problems please convert to absolute paths (i.e. insert the containing directory before each file path in the JSON annotation files).

We provide the TBBRDet software which includes a dataloader and dataset inspection tools which make use of the Detectron2 and MMDetection libraries.

We recommend the following folder structure for use:

├── train/
│   ├── Flug1_100-104Media_coco.json
│   └── images/
│       ├── Flug1_100Media/
│       │   ├── DJI_XXXX_R.npy
│       │   └── ...
│       ├── ...
│       └── Flug1_104Media/
│           ├── DJI_XXXX_R.npy
│           └── ...
└── test/
    ├── Flug1_105Media_coco.json
    └── images/
        └── Flug1_105Media/
            ├── DJI_XXXX_R.npy
            └── ...

Metadata:

The experimental metadata was structured with the Spatio Temporal Asset Catalog (STAC) specification family. This specification provides a standardized way for describing geospatial assets. It defines related JSON object types of Item, Catalog, and Catalog, extending on Collection as the basis.

One STAC Collection JSON object provides information about the recorded images and the environmental conditions during recordings. It also contains information about the overall bounding box of the entire area in which images were recorded.

This object links to related STAC Item JSON objects containing information about the recorded city blocks and the cameras. The objects for the city blocks contain the GeoJSON geometry of the respective block and the
corresponding bounding box. The objects containing the camera information are based on an existing STAC extension for camera related metadata.

Metadata of the archived NumPy files for each image was structured using the Data Package schema from the Frictionless Standards. This standard describes a collection of data files. Therefore, metadata about all containerized NumPy files of the six flight paths (Flug1_100Media - Flug1_104Media blocks and Flug1_105Media block) is provided within a JSON-based file.

Note that camera1 corresponds to the RGB camera and camera2 the thermal.

FAIR Digital Objects:

All files are represented in a standardized way as FAIR Digital Objects
(FAIR DOs)
to enable machine actionable decisions on the data in spirit of
the FAIR principles.

Persistent Identifier (PID):

Persistent Identifiers (PIDs) are resolvable with the Handle.Net Registry (HNR).

File Persistent Identifier (PID)
Flug1_100-104Media_coco.json 21.11152/6ea60288-d895-414e-80c0-26c9fdd662b2
Flug1_105Media_coco.json 21.11152/58d43ddc-5e29-4980-8675-ae579b50a1e2
Flug1_100.tar.zst 21.11152/6858a0b5-cc60-40e9-afef-8c2dd8b35e8e
Flug1_101.tar.zst 21.11152/e670f510-7e00-4d3a-9b90-3bac7a7c069e
Flug1_102.tar.zst 21.11152/3ab9f444-05f6-445e-a691-62fae4021bea
Flug1_103.tar.zst 21.11152/365fd8cf-8e86-41b8-9d0e-b816fdd01d29
Flug1_104.tar.zst 21.11152/041a6111-644a-4617-afb3-3c421a88e8e3
Flug1_105.tar.zst 21.11152/f48bf4e7-3879-4216-8f64-45a060b8f658
Flug1_100-105_frictionless_standards.json 21.11152/7b58b3b5-75eb-4417-ac4d-abe025e159f6
Flug1_collection_stac_spec.json 21.11152/ba370aa3-6422-428c-9ff7-c2ef429df603
Flug1_100_stac_spec.json 21.11152/09cb76fc-b8cb-4116-a22a-68c5bdfa77b0
Flug1_101_stac_spec.json 21.11152/24a55398-b96b-43dd-b0fb-cd8ce302c7ce
Flug1_102_stac_spec.json 21.11152/721234ac-4b5a-4d02-9944-82a08ef2db35
Flug1_103_stac_spec.json 21.11152/ebaeb5bc-0514-47c9-bcd2-98f0253843d8
Flug1_104_stac_spec.json 21.11152/9854677c-77c5-4a0b-916b-57dd9ec20198
Flug1_105_stac_spec.json 21.11152/cfd0fc0e-f5ea-464e-a57f-28e882924860
Flug1_camera1_stac-spec.json 21.11152/976fcf28-f924-4a21-b53d-5d054ad8198d
Flug1_camera2_stac-spec.json 21.11152/37833c54-1d36-42e4-858d-831447122863

Files

Flug1_100-104Media_coco.json

Files (68.5 GB)

Name Size Download all
md5:f133dd7d1faacf15a434e2e8ece0d57b
1.7 MB Preview Download
md5:9bb8b601a54a437dde24caf81e2821e7
1.1 kB Preview Download
md5:716acce83a51ad2fc958ab3ce0026f71
6.0 GB Download
md5:68aefd5db22d527419b5fb5859794f87
1.5 kB Preview Download
md5:57219d1c2d8917a85c1ea32459f61a84
13.9 GB Download
md5:e4a9a00cb675cf3c6c7b9b1db6828930
1.6 kB Preview Download
md5:9df9c0b8e48a98f2ae4efa19ec62d996
13.7 GB Download
md5:b7c682415affe8d8fbf2cfbd41d9aaec
1.6 kB Preview Download
md5:8aca44b36e15e61478debffdb9edda18
14.3 GB Download
md5:cd71b80aecb83052a20e6beeeca74b59
1.5 kB Preview Download
md5:850f11548064ae5c8ef7e376de54e7cf
14.7 GB Download
md5:2eb84e5f8de0dae9e122ac4c1dd0b3aa
1.5 kB Preview Download
md5:aa1fb7275480142c04070859b8a2b5a5
5.9 GB Download
md5:91fb379019bd48cf98e020fc584cc6a2
1.5 kB Preview Download
md5:5a4732a6ce1aa27064569f6248ed2a9c
361.2 kB Preview Download
md5:4afe74b909027218d51eabd4687ba3d3
907 Bytes Preview Download
md5:41351dd87d3884aa8d7643971a1a1ddb
936 Bytes Preview Download
md5:01cd960bfa4d927bd96b813c4dcdbb57
3.1 kB Preview Download

Additional details

Related works

Is supplement to
Conference paper: 10.5445/IR/1000136256 (DOI)
Journal article: 10.1016/j.autcon.2022.104690 (DOI)
Is supplemented by
Dataset: 10.5281/zenodo.7351622 (DOI)
References
Software: https://github.com/Helmholtz-AI-Energy/TBBRDet (URL)