PomerFish: A dataset for fishes across Pomerania freshwater waterbodies in-situ environments
Authors/Creators
Contributors
Data collector:
Description
Overview:
We established a novel dataset through underwater video surveillance across temperate freshwater habitats in the Pomeranian region of Central Europe, focusing on endangered Salmon and ecologically significant species. Researchers collected these videos between 2015 and 2024 using a GoPro Hero 5 camera model. The PomerFish dataset comprises two sub-datasets: PomerFishObj and PomerFishSeg. Our dataset features both bounding box and segmentation mask annotations, enabling its application in in-habitat monitoring and growth status evaluation of freshwater species, a capability surpassing existing datasets. For the annotations of the images, CVAT v2.13 open-source software was utilized.
Usage:
The entire dataset is packaged within a compressed archive file named PomerFish.rar. PomerFishObj cotains imges and a coco format json file. PomerFishSeg provides two distinct segmentation masks: (1) semantic segmentation masks (SegmentationClass), where each pixel is labeled with a categorical class, and (2) instance segmentation masks (SegmentationObject), which delineate individual object instances for counting. The sample of dataset (PomerFish_subset.rar) were provided for users to check the dataset structure. For easy PyTorch integration, we provide two notebook scripts: one for object detection (train_detection) and one for semantic segmentation (train_segmentation). These scripts include pre-processing, PyTorch dataloaders, and complete baselines using pre-trained YOLOv5 and deeplabv3_resnet50 models.