Published February 17, 2022 | Version v1
Dataset Open

Data from: Investigating human repeatability of a computer vision based task to identify meristems on a potato plant (Solanum tuberosum)

Description

Labelled training data in artificial intelligence (AI) is used to teach so-called 'supervised learning models'. However, such data may contain error or bias, which can impact model prediction accuracy. Thus, obtaining accurate training data is of high importance. In applications of AI, such as in classification and detection problems, raw training data is not always made available in published research. Likewise, the process of obtaining labelled data is not always documented well enough to enable reproducibility. This training data set captures a repeatability exercise in AI training data collection for a task that is difficult for humans to perform, delineating a bounding box in a two-dimensional image of a growing apical meristem in potato plants.

Notes

Contents

•"SURNAME-stem-repeatability.zip" file contains folder with 30 images (10 unique) and bounding box data capture program "desktop_image_labeller.py"

•"Tuber-stem-repeatability-instructions.docx"- instructions for observers taking part in the computer experiment

•"further-information-distance-between-centres.docx" - details on the distance between centers measurement

•"boxes.xlsx" - dataset explaining the information for each individual bounding box

•"c_dist.xlsx" - dataset for the distance between the centers

•"stems.xlsx" - dataset providing information on the number of stems identified per image

• "repeatability-images-cheat-sheet.xlsx" – dataset providing a key to the unique images that have been replicated three times

•"DRYAD-README.docx" - Use notes for repeatability data capture and associated documents

Funding provided by: UK Research and Innovation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100014013
Award Number: 48752

Files

Files (574.2 kB)

Name Size Download all
md5:f2a12c80abc8bf57565d7d3b3a1a9025
311.4 kB Download
md5:13c5e84e7286cbab7304a449d14f9d28
225.1 kB Download
md5:30f69419b987c3f0c7cd8d9ca82b689f
17.3 kB Download
md5:e06ee542348ecf2ed0298159cfaa76a6
20.4 kB Download

Additional details

Related works

Is source of
10.5281/zenodo.5799425 (DOI)