Published October 13, 2018 | Version 1.0
Dataset Open

Data from Automated plankton image analysis using convolutional neural networks

  • 1. Rosenstiel School of Marine and Atmospheric Sciences, University of Miami, Miami FL; Hatfield Marine Science Center, Oregon State University, Newport OR
  • 2. Sorbonne Universités, UPMC Univ Paris 06, Laboratoire d'Océanographie de Villefranche
  • 3. Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
  • 4. Rosenstiel School of Marine and Atmospheric Sciences, University of Miami, Miami FL
  • 5. Center for Computational Science, University of Miami
  • 6. Hatfield Marine Science Center, Oregon State University, Newport OR; Rosenstiel School of Marine and Atmospheric Sciences, University of Miami, Miami FL

Description

Datasets and code from Luo et al., "Automated plankton image analysis using convolutional neural networks." Limnology and Oceanography Methods.

Data include:

1) 42,564 item training library, sorted in 108 classes,

2) 42,548 item test set for filtering thresholds, sorted into 38 groups. These images are independent from the training library, and are used for setting the thresholds for post-classification filtering.
CSV file: Luo_etal_FT_images_pred.csv contains the image name, predicted class, predicted probability, and validated group. Note that the file class_to_group.csv is needed to match up the class names to the group names.

3) 75,000 item fully random, validated set for confusion matrix calculations, sorted into 38 groups. This set is a representation of the full dataset, selected at random after classification. 
CSV file: Luo_etal_confusionmatrix_images.csv contains the image name, predicted class, predicted probability, and validated group. Note that the file class_to_group.csv is needed to match up the class names to the group names.

 

Scripts and programs:

1) Segmentation.zip contains the scripts and executables for the segmentation program.

2) Plankton_template.zip contains the archived version of the SparseConvNet program used in manuscript (current version available at: https://github.com/btgraham/SparseConvNet or https://github.com/facebookresearch/SparseConvNet)
Note that google-sparsehash is necessary for running SparseConvNet.
Also, plankton_epoch-150.cnn are the weights from the training used in the manuscript, and should be placed in the /weights folder if you want to replicate the classifications.

Files

class_to_group.csv

Files (299.6 MB)

Name Size Download all
md5:c3bb405fae65f1f6b77736c15eac6043
3.4 kB Preview Download
md5:814205fe94afb6dde4de470ed733523c
6.5 MB Preview Download
md5:b371dfb1ec06e59bb7783d015fba4ea8
82.8 MB Download
md5:3d7d5f5da19996af8128ca8a14a31d55
70.2 MB Download
md5:e8466c5d9c0a81617306e5dadd5a56ce
2.9 MB Preview Download
md5:7a2caf1abfbb4b8c3ffe45c9135dedeb
122.9 MB Download
md5:e569cd73f67f3f8ff4bc22a44f7be20a
12.9 MB Download
md5:636d2d0ddaba4f97929ac409004bd856
925.9 kB Preview Download
md5:145610736c81050e002b6cb06d23369c
515.1 kB Preview Download

Additional details

Related works

Is referenced by
10.1002/lom3.10285 (DOI)

Funding

U.S. National Science Foundation
Spatial variability of larval fish in relation to their prey and predator fields: Patterns and interactions from cm to 10s of km in a subtropical, pelagic environment 1419987