DeepCob: Precise and high-throughput analysis of maize cob geometry using deep learning with an application in genebank phenomics
- 1. Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
- 2. Universidad National Agraria La Molina (UNALM), Lima, Peru
- 3. Computational Science Lab, University of Hohenheim, Stuttgart, Germany
Description
Background: Maize cobs are an important component of crop yield that exhibit a high diversity in size, shape and color in native landraces and modern varieties. Various phenotyping approaches were developed to measure maize cob parameters in a high throughput fashion. More recently, deep learning methods like convolutional neural networks (CNN) became available and were shown to be highly useful for high-throughput plant phenotyping. We aimed at comparing classical image segmentation with deep learning methods for maize cob image segmentation and phenotyping using a large image dataset of native maize landrace diversity from Peru.
Results: Comparison of three image analysis methods showed that a Mask R-CNN trained on a diverse set of maize cob images was highly superior to classical image analysis due to its robustness to image quality and object segmentation accuracy (r=0.99). The Mask R-CNN was integrated into a high-throughput pipeline that segments both maize cobs and a ruler in images and then performs automated quantitative analysis of eight phenotypic traits, including diameter, length, ellipticity, asymmetry, aspect ratio and average RGB values for cob color. Statistical analysis identified key training parameters for efficient iterative model updating. Only a small number of 10-20 images are required to update the initial Mask-R-CNN model to process new types of cob images. We demonstrate an application of the pipeline by analyzing phenotypic variation in 19,867 maize cobs contained in 3,449 images of 2,484 accessions from the maize genebank of Peru to identify phenotypically homogeneous and heterogeneous genebank accessions using multivariate clustering.
Conclusions: We show that a single Mask R-CNN model and associated analysis pipeline are widely applicable tools for maize cob phenotyping in contexts like genebank phenomics or plant breeding.
Files
ImgCross-training-data.zip
Files
(18.7 GB)
Name | Size | Download all |
---|---|---|
md5:03130b6f906fb7d1af675662bd4935b4
|
349.7 MB | Preview Download |
md5:064a73849a65d6bb7326c107a343531a
|
83.5 kB | Preview Download |
md5:a0ca651ad974257340bf9df88b544003
|
2.5 GB | Preview Download |
md5:d99dbdcf2797b41204b85c33bad8d6be
|
15.5 GB | Preview Download |
md5:ec33f749a63815aa25f38077d356e9ab
|
3.7 MB | Preview Download |
md5:4386d9992d68b61da3e299d52a1e7bd5
|
321.9 MB | Preview Download |
md5:2f44903f75be7c354bedb8cec96b5502
|
64.9 MB | Preview Download |
md5:452ee6b8fd48b6a7bab3d2b2b593c648
|
31.2 MB | Preview Download |
md5:b544b254e945c3a9533a01cc32a29e04
|
1.2 kB | Preview Download |