Multimodal Object Detection Dataset for Car and Person Detection - Part 1
Authors/Creators
Description
This multimodal dataset was acquired using a drone equipped with a DJI Zenmuse H20T camera to monitor a parking area by navigating through 5 predefined waypoints. The drone flew at an altitude of 60 meters and at each waypoint hovered for 30 seconds to record video using three sensors: the RGB wide lens, RGB zoom lens (at a minimum 2× zoom, simulating a 30-meter altitude), and the thermal lens. The recorded videos were processed to extract individual frames. The dataset is organized into separate ZIP archives for each waypoint and camera lens. Each archive is identified using a combination of the waypoint number and sensor type: 'W' for wide, 'Z' for zoom, and 'T' for thermal. Frame-level annotations are also included and organized in a similar manner.
Files
frames_thermal.zip
Files
(44.0 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:f117a424be0b8a05d784f34ba196f6dd
|
1.4 GB | Preview Download |
|
md5:fcbfa020e11b2e61b72a13a906a0f641
|
17.2 GB | Preview Download |
|
md5:e93938e8b869aa77d2ce4aafb0d36473
|
25.3 GB | Preview Download |
|
md5:1caaff5a31c14941060559b4eb721561
|
13.4 MB | Preview Download |
|
md5:1aca2e5936a0ebedc75bdf6b742d7cce
|
5.8 MB | Preview Download |