IFCB Dataset for quantificaton applications

González-González, Pablo

doi:10.5281/zenodo.10036244

Published October 23, 2023 | Version v2

Dataset Open

IFCB Dataset for quantificaton applications

González-González, Pablo¹

1. Universidad de Oviedo

### IFCB Quantification

#### Introduction

This dataset is based on the data available publicly at https://github.com/hsosik/WHOI-Plankton.

The scripts for the processing are available at https://github.com/pglez82/IFCB_Zenodo

Basically, this is the IFCB dataset with precomputed features for testing quantification algorithms.

Samples from 2006 to 2008 (286 samples) are considered as training data for quantification algorithms.

Samples from 2009 to 2014 (678 samples) are considered as test bags for which the prevalence of the classes must be predicted.

The dataset has 50 classes (check the IFCB.test_prevalences.zip) to get a list of the observed classes.

Deep features have been computed using a resnet34 finetuned in the train data (using the labels). This results in 512 features which are available for all the samples in the dataset.

The label of each example is also available for all the train examples.

Files

IFCB.test.zip

Files (5.1 GB)

Name	Size	Download all
IFCB.test.zip md5:16313b6ef206c06f1b266b7e81149c7e	3.8 GB	Preview Download
IFCB.test_prevalences.zip md5:98fb1c510aa95baa0df16aeb9e6d1e47	286.8 kB	Preview Download
IFCB.train.zip md5:2bf72977876111aafc40a48beda7e610	1.3 GB	Preview Download

	All versions	This version
Views	106	88
Downloads	111	94
Data volume	181.5 GB	155.3 GB

IFCB Dataset for quantificaton applications

Creators

Description

Files

IFCB.test.zip

Files (5.1 GB)