Published October 23, 2023 | Version v2
Dataset Open

IFCB Dataset for quantificaton applications

  • 1. ROR icon University of Oviedo

Description

### IFCB Quantification

#### Introduction

This dataset is based on the data available publicly at https://github.com/hsosik/WHOI-Plankton.

The scripts for the processing are available at https://github.com/pglez82/IFCB_Zenodo

Basically, this is the IFCB dataset with precomputed features for testing quantification algorithms.

Samples from 2006 to 2008 (286 samples) are considered as training data for quantification algorithms.

Samples from 2009 to 2014 (678 samples) are considered as test bags for which the prevalence of the classes must be predicted.

The dataset has 50 classes (check the IFCB.test_prevalences.zip) to get a list of the observed classes.

Deep features have been computed using a resnet34 finetuned in the train data (using the labels). This results in 512 features which are available for all the samples in the dataset.

The label of each example is also available for all the train examples.


 

Files

IFCB.test.zip

Files (5.1 GB)

Name Size Download all
md5:16313b6ef206c06f1b266b7e81149c7e
3.8 GB Preview Download
md5:98fb1c510aa95baa0df16aeb9e6d1e47
286.8 kB Preview Download
md5:2bf72977876111aafc40a48beda7e610
1.3 GB Preview Download