There is a newer version of the record available.

Published April 27, 2024 | Version v2
Dataset Open

A machine learning framework for extracting information from biological pathway images in the literature

  • 1. ROR icon Korea Advanced Institute of Science and Technology

Description

Training and validation datasets_arrow detection.zip:
Training and validation datasets for arrow detection using Faster R-CNN model. A total of 6,471 images have been prepared, including 2,332 images from five different sources and 4,139 augmented images.

Test dataset_arrow detection.zip:
Test dataset for arrow detection using Faster R-CNN model. A total of 100 images have been prepared from 89 papers searched through PubMed Central (PMC).

EBPI outputs.txt:
Reaction information extracted using EBPI from 49,846 biological pathway images across 466 target chemicals.

Files

EBPI outputs.txt

Files (952.1 MB)

Name Size Download all
md5:5071b63b36e3cfcdd6b178d1ce679565
200.7 MB Preview Download
md5:5979ae013794eaef483aebc92904d938
11.3 MB Preview Download
md5:a85d2a5057de84a8d7630f0eda87c9f6
740.1 MB Preview Download

Additional details

Software