Published 2026 | Version v2
Dataset Open

Validation Dataset for "Entomoscope 2.0 and ENIMAS 2.0: An Open-Source, AI-Integrated Platform for Rapid and Affordable Insect Digitization"

Authors/Creators

  • 1. Karlsruhe Institute of Technology - KIT

Description

This repository contains the complete dataset and validation dataset used in the manuscript "Entomoscope 2.0 and ENIMAS 2.0: An Open-Source, AI-Integrated Platform for Rapid and Affordable Insect Digitization".

The data is divided into two primary components to ensure full reproducibility of the study's results and to support the training of future AI models for insect digitization.

1. Workflow Validation Dataset (Efficiency Benchmark)

This subset comprises 54 insect specimens digitized using two distinct workflows to benchmark operational efficiency:

  1. Entomoscope 2.0: A low-cost, open-source platform using the AI-integrated ENIMAS 2.0 software.

  2. Keyence VHX-7000: A high-end commercial digital microscopy system using a manual workflow.

2. YOLO-Fast Training Dataset:

  • Images and Labels: A custom dataset of 257 manually annotated insect images. This data was used to fine-tune the YOLOv8 object detection model, which powers the "YOLO-Fast" automated specimen cropping method described in the study.

Contents:

  • Raw Images: Original captures from both systems.

  • Processed Images: Output of AI cropping, background removal, and uniform background generation.

  • Morphometric Data: Automated OBB measurements (Entomoscope) vs. Manual measurements (Keyence).

  • Time Logs: Detailed timing data used to calculate the 2.28-fold efficiency speedup reported in the paper.

  • YOLO-Fast Training Data: The 257 original images and their corresponding bounding box labels used for model training.

This data is provided to ensure full reproducibility of the study's results and to support the training of future AI models for insect digitization.

 

 

 

Files

Entomoscope2.0.zip

Files (6.4 GB)

Name Size Download all
md5:fee183fcf2dac3163edc63481d129c20
3.8 GB Preview Download
md5:ff2f82d6525cfb84bb42914d4d8d903e
188.9 MB Preview Download
md5:c2d454ad184435a0d703f5e6875660b2
13.6 kB Download
md5:590ac6e4f10eda5d43659c20220d053c
2.4 GB Preview Download