There is a newer version of the record available.

Published August 15, 2024 | Version 1
Dataset Open

Smartbay Marine Types Object Detection Training dataset

Contributors

Annotator:

Data manager:

  • 1. ROR icon Marine Institute

Description

Training Dataset

The SmartBay Observatory in Galway Bay is an important contribution by Ireland to the growing global network of real-time data capture systems deployed within the ocean – technology giving us new insights into the ocean which we have not had before.

The observatory was installed on the seafloor 1.5km off the coast of Spiddal, County Galway, Ireland . The observatory uses cameras, probes and sensors to permit continuous and remote live underwater monitoring. This observatory equipment allows ocean researchers unique real-time access to monitor ongoing changes in the marine environment. Data relating to the marine environment at the site is transferred in real-time from the SmartBay Observatory through a fibre optic telecommunications cable to the Marine Institute headquarters and onwards onto the internet. The data includes a live video stream, the depth of the observatory node, the sea temperature and salinity, and estimates of the chlorophyll and turbidity levels in the water which give an indication of the volume of phytoplankton and other particles, such as sediment, in the water.

The Smartbay Marine Types Object Detection training Dataset is an initial Bounding Box Annotated image dataset used in attempting to Train a YOLOv8 Object Detection Model to classify the Marine Fauna observed in the Smartbay Observatory Video footage using broad "Marine Type" classes.

The imagery used in this training dataset consists of image frame captures from the Smartbay video Archive files, CC-BY imagery from the www.minka-sdg.org website and images taken by Eva Cullen in the "Galway Atlantaquaria" Aquarium in Galway, Ireland.

The imagery were annotated using CVAT, collated on Roboflow and exported in YOLOv8 trainign dataset format.  

Technical info

Data preprocessing

The following pre-processing was applied to each image on the Roboflow platform:

  • Auto-orientation of pixel data (with EXIF-orientation stripping
  • Resize to 640x640 (Stretch)

The following augmentation was applied to create 3 versions of each source image: 

  • Random brigthness adjustment of between -25 and +25 percent
  • Random exposure adjustment of between -23 and +23 percent
  • Random Gaussian blur of between 0 and 3 pixels
  • Salt and pepper noise was applied to 1.88 percent of pixels

Data splitting

The initial training dataset was split as follows: 86% Training Set, 10%  Validation Set and 4% Test Set.

Data labelling

The classes used in the initial dataset are:

Eel, Fish, Flat Fish, Jelly, Pipefish, Ray, Seahorse, Shark

Parameters

NA

Data sources

The imagery used in this training dataset consists of image frame captures from the Smartbay video Archive files, CC-BY imagery from the www.minka-sdg.org website and images taken by Eva Cullen in the "Galway Atlantaquaria" Aquarium in Galway, Ireland.

Data quality

Video frames were extracted from Smart Observatory and Galway Atlantaquaria video recordings for annotation usign CVAT and also ffmpeg.

Target species CC-BY images were also downloaded from minka-sdg.org using a python minka-downloader script (https://github.com/obsea-upc/minka-downloader)

Useful images were manually selected and annotated by the bursor students and collated into training datasets in Roboflow.  

Data resizing

The images in the initial dataset were resized to 640x640 when exported from Roboflow.

Spatial coverage

Smartbay Observatory Spiddal, Galway Bay, Ireland

Galway Atlantaquaria, Seapoint Promenade Galway H91 T2FD, Ireland

Minka-sdg.org Sample Imagery from Spanish and Portuguese waters 

Contact information

Data.Requests@Marine.ie

Files

Marine types T3.v3-smartbay2.yolov8.zip

Files (193.2 MB)

Name Size Download all
md5:9fded53f235b510a48da106760c7a29b
193.2 MB Preview Download

Additional details

Funding

European Commission
iMagine - Imaging data and services for aquatic science 101058625

Dates

Created
2024-08-15
Dataset created in yolov8 format in Roboflow

Biodiversity

Scientific name
Aurelia aurita (Linnaeus, 1758) , Belone belone (Linnaeus, 1760) , Chelidonichthys lucerna (Linnaeus, 1758) , Chelon auratus (Risso, 1810) , Chelon labrosus (Risso, 1827) , Conger conger (Linnaeus, 1758) , Ctenolabrus rupestris (Linnaeus, 1758) , Cyclopterus lumpus Linnaeus, 1758 , Dicentrarchus labrax (Linnaeus, 1758) , Entelurus aequoreus (Linnaeus, 1758) , Eutrigla gurnardus (Linnaeus, 1758) , Hippocampus guttulatus Cuvier, 1829 , Labrus bergylta Ascanius, 1767 , Labrus mixtus Linnaeus, 1758 , Lophius piscatorius Linnaeus, 1758 , Merlangius merlangus (Linnaeus, 1758) , Mullus barbatus Linnaeus, 1758 , Mullus surmuletus Linnaeus, 1758 , Mustelus asterias Cloquet, 1821 , Parablennius gattorugine (Linnaeus, 1758) , Platichthys flesus (Linnaeus, 1758) , Pleuronectes platessa Linnaeus, 1758 , Pollachius pollachius (Linnaeus, 1758) , Pomatoschistus pictus (Malm, 1865) , Raja clavata Linnaeus, 1758 , Raja undulata Lacepède, 1802 , Scomber scombrus Linnaeus, 1758 , Scophthalmus maximus (Linnaeus, 1758) , Scyliorhinus canicula (Linnaeus, 1758) , Solea solea (Linnaeus, 1758) , Spondyliosoma cantharus (Linnaeus, 1758) , Squalus acanthias Linnaeus, 1758 , Squatina squatina (Linnaeus, 1758) , Symphodus melops (Linnaeus, 1758) , Trisopterus luscus (Linnaeus, 1758) , Zeugopterus punctatus (Bloch, 1787)
Scientific name ID
urn:lsid:marinespecies.org:taxname:135306 , urn:lsid:marinespecies.org:taxname:126375 , urn:lsid:marinespecies.org:taxname:127262 , urn:lsid:marinespecies.org:taxname:1044127 , urn:lsid:marinespecies.org:taxname:126977 , urn:lsid:marinespecies.org:taxname:126285 , urn:lsid:marinespecies.org:taxname:126964 , urn:lsid:marinespecies.org:taxname:127214 , urn:lsid:marinespecies.org:taxname:126975 , urn:lsid:marinespecies.org:taxname:127379 , urn:lsid:marinespecies.org:taxname:150637 , urn:lsid:marinespecies.org:taxname:154776 , urn:lsid:marinespecies.org:taxname:126965 , urn:lsid:marinespecies.org:taxname:151501 , urn:lsid:marinespecies.org:taxname:126555 , urn:lsid:marinespecies.org:taxname:126438 , urn:lsid:marinespecies.org:taxname:126985 , urn:lsid:marinespecies.org:taxname:126986 , urn:lsid:marinespecies.org:taxname:105821 , urn:lsid:marinespecies.org:taxname:126770 , urn:lsid:marinespecies.org:taxname:127141 , urn:lsid:marinespecies.org:taxname:127143 , urn:lsid:marinespecies.org:taxname:126440 , urn:lsid:marinespecies.org:taxname:126930 , urn:lsid:marinespecies.org:taxname:105883 , urn:lsid:marinespecies.org:taxname:105891 , urn:lsid:marinespecies.org:taxname:127023 , urn:lsid:marinespecies.org:taxname:127149 , urn:lsid:marinespecies.org:taxname:105814 , urn:lsid:marinespecies.org:taxname:127160 , urn:lsid:marinespecies.org:taxname:127066 , urn:lsid:marinespecies.org:taxname:105923 , urn:lsid:marinespecies.org:taxname:105928 , urn:lsid:marinespecies.org:taxname:273571 , urn:lsid:marinespecies.org:taxname:126445 , urn:lsid:marinespecies.org:taxname:127151
Taxon rank
species