Published April 18, 2025
| Version PROCESSED_DATA
Dataset
Open
Underwater images collected by an Autonomous Surface Vehicle in Aldabra-Arm01, Seychelles - 2022-10-20
- 1. Ifremer DOI, La Réunion, France
- 2. UMR Marbec, IRD, France
- 3. INRIA Zenith, Montpellier, France
Contributors
Data collectors:
Data manager:
Project members:
- Alex Rose2
-
Alexandre Boyer3
- Alvin Jean-Bonnelame2
- Andrea Goharzadeh1
- Anna Koester2
- Annabelle Constance2
- April Burt2
-
Arthur Lazennec1
- Beled De Ana
-
Cam Ly Rintz1
- Christopher Jones2
- Denis De Oliveira1
- Elma Balette2
- Francis Salomon2
- Frauke Fleischer-Dogley2
-
Gaétan Morand3
- Georgette Savy2
- Guilly Mellie2
-
Justine Talpaert Daudon3
- Laurence Maurel4
-
Leanne Carpentier1
-
Magali Duval1
- Michelle Risi2
- Pascal Mouquet1
-
Pierre Gogendeau1
- Sebastian Cowin2
-
Serge Bernard1
-
Sylvain Poulain3
-
Thomas Chevrier1
-
Victor Russias1
- 1. Ifremer DOI, La Réunion, France
- 2. SIF, Seychelles
- 3. UMR Marbec, IRD, La Réunion, France
- 4. Kart'eau, La Réunion, France
Description
This dataset was collected by an Autonomous Surface Vehicle in Aldabra-Arm01, Seychelles - 2022-10-20.
Underwater or aerial images collected by scientists or citizens can have a wide variety of use for science, management, or conservation. These images can be annotated and shared to train IA models which can in turn predict the objects on the images. We provide a set of tools (hardware and software) to collect marine data, predict species or habitat, and provide maps.
This dataset is part of larger collection referencing numerous underwater and aerial images Seatizen Altas. Methods, tools and scientific objectives are also described in a dedicated data paper.
Base : Files coming from rtk a GPS-fixed station or any static positioning instrument which can provide with correction frames.
Device GPS : Emlid Reach M2
Quality of our data - Q1: 88.7 %, Q2: 11.13 %, Q5: 0.17 %
We only keep the values which have a GPS correction in Q1.
We keep the points that are the waypoints.
We keep the raw data where depth was estimated between 0.2 m and 40.0 m deep.
The data are first referenced against the WGS84 ellipsoid.
At the end of processing, the data are projected into a homogeneous grid to create a raster and a shapefiles.
The size of the grid cells is 0.525 m.
The raster and shapefiles are generated by linear interpolation. The 3D reconstruction algorithm is ballpivot.
├── DCIM : folder to store videos and photos depending on the media collected.
├── GPS : folder to store any positioning related file. If any kind of correction is possible on files (e.g. Post-Processed Kinematic thanks to rinex data) then the distinction between device data and base data is made. If, on the other hand, only device position data are present and the files cannot be corrected by post-processing techniques (e.g. gpx files), then the distinction between base and device is not made and the files are placed directly at the root of the GPS folder.
│ ├── BASE : files coming from rtk station or any static positioning instrument.
│ └── DEVICE : files coming from the device.
├── METADATA : folder with general information files about the session.
├── PROCESSED_DATA : contain all the folders needed to store the results of the data processing of the current session.
│ ├── BATHY : output folder for bathymetry raw data extracted from mission logs.
│ ├── FRAMES : output folder for georeferenced frames extracted from DCIM videos.
│ ├── IA : destination folder for image recognition predictions.
│ └── PHOTOGRAMMETRY : destination folder for reconstructed models in photogrammetry.
└── SENSORS : folder to store files coming from other sources (bathymetry data from the echosounder, log file from the autopilot, mission plan etc.).
All predictions were generated by our inference pipeline.
You can find all the necessary scripts to download this data in this repository.
Enjoy your data with SeatizenDOI!
Underwater or aerial images collected by scientists or citizens can have a wide variety of use for science, management, or conservation. These images can be annotated and shared to train IA models which can in turn predict the objects on the images. We provide a set of tools (hardware and software) to collect marine data, predict species or habitat, and provide maps.
This dataset is part of larger collection referencing numerous underwater and aerial images Seatizen Altas. Methods, tools and scientific objectives are also described in a dedicated data paper.
GPS information:
The data was processed with a PPK workflow to achieve centimeter-level GPS accuracy.Base : Files coming from rtk a GPS-fixed station or any static positioning instrument which can provide with correction frames.
Device GPS : Emlid Reach M2
Quality of our data - Q1: 88.7 %, Q2: 11.13 %, Q5: 0.17 %
Bathymetry
The data are collected using a single-beam echosounder S500.We only keep the values which have a GPS correction in Q1.
We keep the points that are the waypoints.
We keep the raw data where depth was estimated between 0.2 m and 40.0 m deep.
The data are first referenced against the WGS84 ellipsoid.
At the end of processing, the data are projected into a homogeneous grid to create a raster and a shapefiles.
The size of the grid cells is 0.525 m.
The raster and shapefiles are generated by linear interpolation. The 3D reconstruction algorithm is ballpivot.
Generic folder structure
YYYYMMDD_COUNTRYCODE-optionalplace_device_session-number├── DCIM : folder to store videos and photos depending on the media collected.
├── GPS : folder to store any positioning related file. If any kind of correction is possible on files (e.g. Post-Processed Kinematic thanks to rinex data) then the distinction between device data and base data is made. If, on the other hand, only device position data are present and the files cannot be corrected by post-processing techniques (e.g. gpx files), then the distinction between base and device is not made and the files are placed directly at the root of the GPS folder.
│ ├── BASE : files coming from rtk station or any static positioning instrument.
│ └── DEVICE : files coming from the device.
├── METADATA : folder with general information files about the session.
├── PROCESSED_DATA : contain all the folders needed to store the results of the data processing of the current session.
│ ├── BATHY : output folder for bathymetry raw data extracted from mission logs.
│ ├── FRAMES : output folder for georeferenced frames extracted from DCIM videos.
│ ├── IA : destination folder for image recognition predictions.
│ └── PHOTOGRAMMETRY : destination folder for reconstructed models in photogrammetry.
└── SENSORS : folder to store files coming from other sources (bathymetry data from the echosounder, log file from the autopilot, mission plan etc.).
Software
All the raw data was processed using our worflow.All predictions were generated by our inference pipeline.
You can find all the necessary scripts to download this data in this repository.
Enjoy your data with SeatizenDOI!
Notes
Files
GPS.zip
Additional details
Identifiers
- URN
- urn:20221020_SYC-ALDABRA-ARM01_ASV-02_01
Related works
- Is metadata for
- Dataset: https://zenodo.org/doi/10.5281/zenodo.11125847 (URL)
Dates
- Collected
-
2022-10-20
- Valid
-
2025-04-18