Published August 13, 2025 | Version v1
Dataset Open

PlanktonFlow76 - A curated FlowCam dataset for plankton classification

Description

This dataset contains a curated collection of FlowCAM images acquired at INRAE UMR DECOD (Rennes, France) during a multi-year mesocosm experiment in controlled aquatic systems. FlowCAM imaging was used to capture planktonic organisms and particles for ecological monitoring and analysis, and images were segmented using ZooProcess.

The dataset is provided in two forms:

  • Raw dataset: ~130,000 images directly collected with the FlowCAM and segmented by ZooProcess (PlanktonFlow-ready starting at the preprocessing step).

  • Processed dataset: ~190,000 images obtained after PlanktonFlow pre-processing (augmentation, splitting, and other steps) and ready for downstream analysis. 

Images are organized by taxonomic classes (species) as identified by experts at UMR DECOD. As with any manual classification, occasional errors or ambiguous categories may be present.

This resource can support ecological studies, machine learning model development, and reproducibility of the associated research. For further details, please refer to the associated publication and the open-source codebase available on GitHub.

Files

preprocessed_PlanktonFlow76.zip

Files (2.4 GB)

Name Size Download all
md5:57546660d07abe88b5d0aeb8aad8d5da
640.4 MB Preview Download
md5:a7d34562e7dcb8dea1f23829c218fc98
1.7 GB Preview Download

Additional details

Related works

Is cited by
Publication: 10.1101/2025.09.19.677346 (DOI)
Is supplemented by
Other: 10.5281/zenodo.19222299 (DOI)

Software

Repository URL
https://github.com/ziraax/PlanktonFlow
Programming language
Python
Development Status
Active