Published November 2, 2017 | Version v1
Journal article Open

Applications of deep convolutional neural networks to digitized natural history collections

  • 1. National Museum of Natural History, Smithsonian Institution, Washington, DC, United States of America
  • 2. Office of the Chief Information Officer, Smithsonian Institution, Washington, DC, United States of America
  • 3. NVIDIA, Santa Clara, CA, United States of America

Description

Natural history collections contain data that are critical for many scientific endeavors. Recent efforts in mass digitization are generating large datasets from these collections that can provide unprecedented insight. Here, we present examples of how deep convolutional neural networks can be applied in analyses of imaged herbarium specimens. We first demonstrate that a convolutional neural network can detect mercury-stained specimens across a collection with 90% accuracy. We then show that such a network can correctly distinguish two morphologically similar plant families 96% of the time. Discarding the most challenging specimen images increases accuracy to 94% and 99%, respectively. These results highlight the importance of mass digitization and deep learning approaches and reveal how they can together deliver powerful new investigative tools.

Files

BDJ_article_21139.pdf

Files (3.4 MB)

Name Size Download all
md5:fc67c46e9facaa956f3452cc192f8656
3.4 MB Preview Download
md5:b40e7f6baadb601af2c2affc8abccc51
42.4 kB Preview Download

Linked records