Published December 18, 2023 | Version 1
Dataset Open

Fusion of Underwater Camera and Multibeam Sonar for Diver Detection and Tracking

Description

Context
 
This dataset is related to previously published public dataset "Sonar-to-RGB Image Translation for Diver Monitoring in Poor Visibility Environments". https://zenodo.org/records/7728089
It contains ZED-right camera and sonar images collected from Hemmoor Lake and DFKI Maritime Exploration Hall.
 
Sensors: Low Frq (1.2MHz) Blueprint Oculus M1200d Sonar and ZED Right Camera
 
Content
 
The dataset is created for Diver Detection and Diver Tracking applications.
 
For the Diver Detection part, the dataset is prepared to train, validate and test YOLOv7 model.
7095 images are used for training data, and 3095 images are used for validation data. These sets are augmented from originally captured and sampled ZED camera images. Augmentation methods are not applied to the Test data, which contains 822 images. Train and validation contain images from both the DFKI pool and Hemmor Lake, while the test data is only collected from the lake.
 
To distinguish between the original image and the augmented image, check the name coding. 
Naming of object detection images:
original_image_name.jpg
if augmented:
original_image_name_<augmentation_number_of_the_same_image>.jpg
 
Object Detection Label Format: 
YOLO [(class), ((x_min + (x_max - x_min)/2)  / image_width), ((y_min + (y_max - y_min)/2)  / image_height), ((x_max - x_min) / image_width), ((y_max - y_min) / image_height)]
 
Class: "diver", represented by "0" in object detection labels.
 
Resolution of Object Detection Camera Images: 640x640
Resolution of Object Tracking Camera Images: 1280x720
Resolution of Object Tracking Low Frequency Sonar: 932x514
 
About the Object Tracking on Sonar, the sampled data is the part where diver moves around the table and the platform. 
There are 4 cases shared in the dataset, which contain a sonar stream, and corresponding ZED-right camera images. 
Totally, 1193 points represent the diver on sonar images for the diver tracking application.
 
For the tracking, "tracking_sonar_coordinates_<number>.csv" contains x,y coordinates of a point where the diver is in the sonar image. 
And "image_sonar_<number>.csv" file contains the matching between sonar and camera images.
 
Acknowledgements
 
The data in this repository were collected as a joint effort between the German Center for Artificial Intelligence (DFKI), the German Federal Agency for technical Relief (THW), and Kraken Robotics GmbH. This work is part of the project DeeperSense that received funding from the European Commission. Program H2020-ICT-2020-2 ICT-47-2020 Project Number: 101016958.

 

Files

Diver Detection - Camera.zip

Files (641.1 MB)

Name Size Download all
md5:c381f392a8a040204a5160a0f34d580c
361.4 MB Preview Download
md5:b31e0b43806937f76933c9bc31ba75db
279.7 MB Preview Download

Additional details

Related works

References
Dataset: 10.5281/zenodo.7728088. (DOI)

Funding

European Commission
DeeperSense - Deep-Learning for Multimodal Sensor Fusion 101016958

Dates

Submitted
2023-12