Published June 11, 2025 | Version v.1.0.0
Dataset Open

Sea-Undistort: A Synthetic Dataset for Restoring Through-Water Images in Airborne Bathymetric Mapping

  • 1. ROR icon Technische Universität Berlin
  • 2. ROR icon Berlin Institute for the Foundations of Learning and Data

Description

The dataset

To address the absence of real-world paired imagery with and without wave- and water-induced distortions, we introduce Sea-Undistort, a synthetic dataset created using the open-source 3D graphics platform Blender. The dataset comprises 1200 image pairs, each consisting of 512×512 pixel RGB renderings of shallow underwater scenes. Every pair includes a “non-distorted” image, representing minimal surface and column distortions, and a corresponding “distorted” version that incorporates realistic optical phenomena such as sun glint, wave-induced deformations, turbidity, and light scattering. These effects are procedurally generated to replicate the diverse challenges encountered in through-water imaging for bathymetry. The scenes are designed with randomized combinations of typical shallow-water seabed types, including rocky outcrops, sandy flats, gravel beds, and seagrass patches, capturing a wide range of textures, reflectance patterns, and radiometric conditions. Refraction is accurately modeled in both the distorted and non-distorted images to maintain geometric consistency with real underwater imaging physics.

In addition, camera settings are uniformly sampled within specific ranges to ensure diverse imaging conditions. Sensor characteristics include a physical width of 36 mm and effective pixel widths of 4000 or 5472 pixels. Focal lengths of 20 mm and 24 mm are simulated with only the central 512x512 pixels rendered. Camera altitude ranges from 30 m to 200 m, resulting in a ground sampling distance (GSD) between 0.014 m and 0.063 m. Average depths range from –0.5 m to –8 m, with a maximum tilt angle of 5°. Sun elevation angles between 25° and 70°, along with varying atmospheric parameters (e.g., air, dust), are used to simulate different illumination conditions. Generated images are accompanied by a .json file containing this metadata per image. 

Sea-Undistort is designed to support supervised training of deep learning models for through-water image enhancement and correction, enabling generalization to real-world conditions where undistorted ground truth is otherwise unobtainable.

Citation

If you use the dataset, please cite:

 

Acknowledgment
This work was part of the project MagicBathy which is a research project funded by the European Commission for the period 2023-2025. It is funded under the HORIZON Europe MSCA Postdoctoral Fellowships - European Fellowships (GA 101063294).

Technical info

Folder structure
 
┗ 📂 Sea-Undistort/
  ┣ 📜 render_0000_ground.png
  ┣ 📜 render_0000_no_sunglint.png
  ┣ 📜 render_0000_no_waves.png
  ┣ 📜 render_0000.png
  ┣ 📜 render_0001_ground.png
  ┣ 📜 render_0001_no_sunglint.png
  ┣ 📜 render_0001_no_waves.png
  ┣ 📜 render_0001.png
  ┣ 📜 ...
  ┣ 📜 render_1199_ground.png
  ┣ 📜 render_1199_no_sunglint.png
  ┣ 📜 render_1199_no_waves.png
  ┣ 📜 render_1199.png
  ┣ 📜 scene_settings.json
  ┗ 📜 LICENSE_and_info.txt

Files

Sea-Undistort.zip

Files (2.3 GB)

Name Size Download all
md5:8054df179c1b01ba26c4910b7105d34a
2.3 GB Preview Download

Additional details

Related works

Is described by
Preprint: 10.48550/arXiv.2508.07760 (DOI)

Funding

European Commission
MagicBathy - Multimodal multitAsk learninG for MultIsCale BATHYmetric mapping in shallow waters 101063294

Dates

Available
2025-08-06

Software

Repository URL
https://github.com/pagraf/Sea-Undistort
Development Status
Active