Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published July 9, 2024 | Version 1.0.0
Dataset Open

BWILD: Beach seagrass Wrack Identification Labelled Dataset

Contributors

Project member:

  • 1. ROR icon Balearic Islands Coastal Observing and Forecasting System

Description

Training dataset

BWILD is a dataset tailored to train Artificial Intelligence applications to automate beach seagrass wrack detection in RGB images. It includes oblique RGB images captured by SIRENA beach video-monitoring systems, along with corresponding annotations, auxiliary data and a README file. BWILD encompasses data from two microtidal sandy beaches in the Balearic Islands, Spain. The dataset consists of images with varying fields of view (9 cameras), beach wrack abundance, degrees of occupation, and diverse meteoceanic and lighting conditions. The annotations categorise image pixels into five classes: i) Landwards, ii) Seawards, iii) Diffuse wrack, iv) Intermediate wrack, and v) Dense wrack.

Technical details

The BWILD version 1.0.0 is packaged in a compressed file (BWILD_v1.0.0.zip). A total of 3286 RGB images are shared in PNG format, corresponding annotations and masks in various formats (PNG, XML, JSON), and the README file in PDF format.

Data preprocessing

The BWILD dataset utilizes snapshot images from two SIRENA beach video-monitoring systems. To facilitate annotation while maintaining a diverse range of scenarios, the original 1280x960 pixel images were cropped to smaller regions, with a uniform resolution of 640x480 pixels. A subset of images was carefully curated to minimize annotation workload while ensuring representation of various time periods, distances to camera, and environmental conditions.  Image selection involved filtering for quality, clustering for diversity, and prioritizing scenes containing beach seagrass wracks. Further details are available in the README file. 

Data splitting

Data splitting requirements may vary depending on the chosen Artificial Intelligence approach (e.g., splitting by entire images or by image patches). Researchers should use a consistent method and document the approach and splits used in publications, enabling reproducible results and facilitating comparisons between studies. 

Classes, labels and annotations

The BWILD dataset has been labelled manually using the 'Computer Vision Annotation Tool' (CVAT), categorising pixels into five labels of interest using polygon annotations.

        Label                                                                              Description
landwards Pixels that are towards the landside with respect to the shoreline
seawards Pixels that are towards the seaside with respect to the shoreline
diffuse wrack Pixels that potentially resembled beach wracks based on colour and shape, yet the annotator could not confirm this with certainty, were denoted as ‘diffuse wrack’
Intermediate wrack Pixels with low-density beach wracks or mixed beach wracks and sand surfaces
Dense wrack Pixels with high-density beach wracks

Annotations were exported from CVAT in three different formats: (i) CVAT for images (XML); (ii) Segmentation Mask 1.0 (PNG); (iii) COCO (JSON). These diverse annotation formats can be used for various applications including object detection and segmentation, and simplify the interaction with the dataset, making it more user-friendly. Further details are available in the README file.  

Parameters

RGB values or any transformation in the colour space can be used as parameters.

Data sources

A SIRENA system consists of a set of RGB cameras mounted at the top of buildings on the beachfront. These cameras take oblique pictures of the beach, with overlapping sights, at 7.5 FPS during the first 10 minutes of each hour in daylight hours. From these pictures, different products are generated, including snapshots, which correspond to the frame of the video at the 5th minute. In the Balearic Islands, SIRENA stations are managed by the Balearic Islands Coastal Observing and Forecasting System (SOCIB), and are mounted at the top of hotels located in front of the coastline. The present dataset includes snapshots from the SIRENA systems operating since 2011 at Cala Millor (5 cameras) and Son Bou (4 cameras) beaches, located in Mallorca and Menorca islands (Balearic Islands, Spain), respectively. All latest and historical SIRENA images are available at the Beamon app viewer (https://apps.socib.es/beamon). 

Data quality

All images included in BWILD have been supervised by the authors of the dataset. However, variable presence of beach segrass wracks across different beach segments and seasons impose a variable distribution of images across different SIRENA stations and cameras. Users of BWILD dataset must be aware of this variance. Further details are available in the README file. 

Image resolution

The resolution of the images in BWILD is of 640x480 pixels.

Spatial coverage

The BWILD version 1.0.0 contains data from two SIRENA beach video-monitoring stations, encompassing two microtidal sandy beaches in the Balearic Islands, Spain. These are: Cala Millor (clm) and Son Bou (snb). 

SIRENA station  Longitude   Latitude
clm 3.383 39.596
snb 4.077 39.898

Contact information

For further technical inquiries or additional information about the annotated dataset, please contact jsoriano@socib.es.

Notes (English)

The BWILD dataset is a product of the "Beach Monitoring Use Case" within the "iMagine project" with funding from the European Union's Horizon Europe research and innovation programme. The authors express their gratitude to the project managers and all partners involved for fostering the creation of open-access image repositories for AI-based image analysis services. 

Files

BWILD_v1.0.0.zip

Files (1.1 GB)

Name Size Download all
md5:f8cb139f2585c5cc43166459a0e36078
1.1 GB Preview Download
md5:6ccf6ea8f68a2b049172965e27cbdd80
5.1 MB Preview Download

Additional details

Funding

This work was supported by the iMagine project with funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No. 101058625. 101058625
European Union
iMagine — Imaging data and services for aquatic science 101058625
European Commission

Dates

Created
2024-07