BWILD: Beach seagrass Wrack Identification Labelled Dataset
Creators
Contributors
Project member:
Description
Training dataset
BWILD is a dataset tailored to train Artificial Intelligence applications to automate beach seagrass wrack detection in RGB images. It includes oblique RGB images captured by SIRENA beach video-monitoring systems, along with corresponding annotations, auxiliary data and a README file. BWILD encompasses data from two microtidal sandy beaches in the Balearic Islands, Spain. The dataset consists of images with varying fields of view (9 cameras), beach wrack abundance, degrees of occupation, and diverse meteoceanic and lighting conditions. The annotations categorise image pixels into five classes: i) Landwards, ii) Seawards, iii) Diffuse wrack, iv) Intermediate wrack, and v) Dense wrack.
Technical details
The BWILD version 1.0.0 is packaged in a compressed file (BWILD_v1.0.0.zip). A total of 3286 RGB images are shared in PNG format, corresponding annotations and masks in various formats (PNG, XML, JSON), and the README file in PDF format.
Data preprocessing
The BWILD dataset utilizes snapshot images from two SIRENA beach video-monitoring systems. To facilitate annotation while maintaining a diverse range of scenarios, the original 1280x960 pixel images were cropped to smaller regions, with a uniform resolution of 640x480 pixels. A subset of images was carefully curated to minimize annotation workload while ensuring representation of various time periods, distances to camera, and environmental conditions. Image selection involved filtering for quality, clustering for diversity, and prioritizing scenes containing beach seagrass wracks. Further details are available in the README file.
Data splitting
Data splitting requirements may vary depending on the chosen Artificial Intelligence approach (e.g., splitting by entire images or by image patches). Researchers should use a consistent method and document the approach and splits used in publications, enabling reproducible results and facilitating comparisons between studies.
Classes, labels and annotations
The BWILD dataset has been labelled manually using the 'Computer Vision Annotation Tool' (CVAT), categorising pixels into five labels of interest using polygon annotations.
Label | Description |
landwards | Pixels that are towards the landside with respect to the shoreline |
seawards | Pixels that are towards the seaside with respect to the shoreline |
diffuse wrack | Pixels that potentially resembled beach wracks based on colour and shape, yet the annotator could not confirm this with certainty, were denoted as ‘diffuse wrack’ |
Intermediate wrack | Pixels with low-density beach wracks or mixed beach wracks and sand surfaces |
Dense wrack | Pixels with high-density beach wracks |
Annotations were exported from CVAT in three different formats: (i) CVAT for images (XML); (ii) Segmentation Mask 1.0 (PNG); (iii) COCO (JSON). These diverse annotation formats can be used for various applications including object detection and segmentation, and simplify the interaction with the dataset, making it more user-friendly. Further details are available in the README file.
Parameters
RGB values or any transformation in the colour space can be used as parameters.
Data sources
A SIRENA system consists of a set of RGB cameras mounted at the top of buildings on the beachfront. These cameras take oblique pictures of the beach, with overlapping sights, at 7.5 FPS during the first 10 minutes of each hour in daylight hours. From these pictures, different products are generated, including snapshots, which correspond to the frame of the video at the 5th minute. In the Balearic Islands, SIRENA stations are managed by the Balearic Islands Coastal Observing and Forecasting System (SOCIB), and are mounted at the top of hotels located in front of the coastline. The present dataset includes snapshots from the SIRENA systems operating since 2011 at Cala Millor (5 cameras) and Son Bou (4 cameras) beaches, located in Mallorca and Menorca islands (Balearic Islands, Spain), respectively. All latest and historical SIRENA images are available at the Beamon app viewer (https://apps.socib.es/beamon).
Data quality
All images included in BWILD have been supervised by the authors of the dataset. However, variable presence of beach segrass wracks across different beach segments and seasons impose a variable distribution of images across different SIRENA stations and cameras. Users of BWILD dataset must be aware of this variance. Further details are available in the README file.
Image resolution
The resolution of the images in BWILD is of 640x480 pixels.
Spatial coverage
The BWILD version 1.0.0 contains data from two SIRENA beach video-monitoring stations, encompassing two microtidal sandy beaches in the Balearic Islands, Spain. These are: Cala Millor (clm) and Son Bou (snb).
SIRENA station | Longitude | Latitude |
clm | 3.383 | 39.596 |
snb | 4.077 | 39.898 |
Contact information
For further technical inquiries or additional information about the annotated dataset, please contact jsoriano@socib.es.
Notes (English)
Files
BWILD_v1.0.0.zip
Files
(1.1 GB)
Name | Size | Download all |
---|---|---|
md5:f8cb139f2585c5cc43166459a0e36078
|
1.1 GB | Preview Download |
md5:6ccf6ea8f68a2b049172965e27cbdd80
|
5.1 MB | Preview Download |
Additional details
Funding
Dates
- Created
-
2024-07