Published March 19, 2026 | Version v1
Dataset Open

PlanktoShare: A large (50k+) and FAIR learning set for the Plankton Imager (Pi-10) for the Greater North Sea and NE Atlantic, based on a new flexible classification protocol

  • 1. Rijkswaterstaat
  • 2. EDMO icon Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory
  • 3. ROR icon British Antarctic Survey
  • 4. ROR icon Plymouth Marine Laboratory
  • 5. EDMO icon Wageningen Marine Research (Den Helder)
  • 6. ROR icon Royal Netherlands Institute for Sea Research

Description

This is the accompanying dataset to the submitted paper "PlanktoShare: A large (50k+) and FAIR learning set for the Plankton Imager (Pi-10) for the Greater North Sea and NE Atlantic, based on a new flexible classification protocol." by Lodewijk Van Walraven, James Scott, Sophie Pitois, Joseph Ribeiro, Hayden Close, Cecilia M. Liszka, Elaine Fileman, Jeroen Hoekendijk, Pieter Hovenkamp, Robbert Jak, Joost van Dalen, Dick van Oevelen.

The dataset contains labeled images of plankton, detritus and other particles collected using the Pi-10 Plankton Imager (Plankton Analytics) in the North East Atlantic area on the research vessels Tridens II and CEFAS Endeavour. The labeling convention is explained in the paper and in the file PlanktoShare_readme.txt. 

Code for training and inference on new unseen images is publicly available at: github.com/geoJoost/plankton_imager_classifier, to allow iterative model development by other researchers

 

Files

PlanktoShare.zip

Files (2.2 GB)

Name Size Download all
md5:a7c5f40f20346baeb98e462f0cf895fd
2.2 GB Preview Download
md5:16af344cfa4c4efbab028be9fe373d5a
7.5 MB Preview Download
md5:c0273a0cf01d9d7c1ac9c8ea03161ed2
2.1 kB Preview Download

Additional details

Software

Repository URL
https://github.com/geoJoost/planktoshare
Programming language
Python
Development Status
Active