Published October 24, 2020 | Version 1.0.0
Dataset Open

SDT Dataset | Sdt: A Synthetic Multi-Modal Dataset For Person Detection And Pose Classification

  • 1. TU Wien, Computer Vision Lab

Description

The Synthetic Depth & Thermal (SDT) dataset consists of 40k synthetic and 8k real depth and thermal stereo images, depicting human behavior in indoor environments. Included samples show uniquely posed lying, sitting, and standing persons within four different room types (living room, bedroom, bathroom, and kitchen), recorded from an elevated position. Furthermore, a fourth control class with empty rooms is provided as well. Both parts of SDT are balanced sets of these four classes and room types. The synthetic part of the dataset is intended to be used as training (and validation) data for uni-/multi-modal pose classification or person detection models, while the real part can be used to assess the generalization performance. To facilitate supervised training, pose labels and person bounding boxes are provided for all images. The real images in the dataset were captured by a multi-modal stereo camera system, consisting of an Orbbec Astra depth camera and a FLIR Lepton 3.5 thermal camera, while synthetic images, which share the image characteristics of these cameras, were acquired through 3D rendering of virtual scenes within Blender and subsequent introduction of camera-specific noise.

Download and Use
This data may be used for non-commercial research purposes only. If you publish material based on this data, we request that you include a reference to our paper [1].

[1] C. Pramerdorfer, J. Strohmayer and M. Kampel, "Sdt: A Synthetic Multi-Modal Dataset For Person Detection And Pose Classification," 2020 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2020, pp. 1611-1615, doi: 10.1109/ICIP40778.2020.9191284.

BibTeX citation:

@INPROCEEDINGS{9191284,
  author={Pramerdorfer, C. and Strohmayer, J. and Kampel, M.},
  booktitle={2020 IEEE International Conference on Image Processing (ICIP)}, 
  title={Sdt: A Synthetic Multi-Modal Dataset For Person Detection And Pose Classification}, 
  year={2020},
  volume={},
  number={},
  pages={1611-1615},
  doi={10.1109/ICIP40778.2020.9191284}}

Files

camera.zip

Files (19.2 GB)

Name Size Download all
md5:21a5e06a8487589c89a2f1595fbe7a15
14.0 kB Preview Download
md5:543ae9b4e29466b6e5b9be5ca44b22bc
3.4 kB Preview Download
md5:d59a739f3b5ecf373c94046fb94cd94f
1.7 GB Preview Download
md5:a7dfe81a1db58219da14db966d75cb2e
4.2 GB Download
md5:5e56bf4c17a2ce2f4b5cb59881dd161e
4.2 GB Download
md5:5f4c46025a46139db311382aa709a3a1
4.2 GB Download
md5:fbc26a3785540ff269410cdc43d53eae
1.4 GB Download
md5:5464368b4798b50c59de3e06599b2677
3.5 GB Preview Download

Additional details