Journal article Open Access

DrugProt corpus genes and proteins annotation guidelines [GPRO - Biocreative 5.2]

Rabal, Obdulia; Krallinger, Martin

Annotation guidelines used for the annotations of gene and protein-related objects of the CHEMDNER, ChemProt and DrugProt corpora.



The annotation guidelines have been refined after iterative cycles of annotations of sample documents. It also incorporated suggestions made by curators as well as observations of annotation inconsistencies encountered when comparing results from different human curators.

In brief, the annotated GPROs include genes, gene products
(proteins, RNA), DNA/protein sequence elements and protein families, domains and complexes. The aim of the iterative manual annotation cycles was to improve the quality and consistency of the guidelines, in order to make them more intuitive and easier to follow.


Please, cite:

@article{perez2016markyt, title={The Markyt visualisation, prediction and benchmark platform for chemical and gene entity recognition at BioCreative/CHEMDNER challenge}, author={P{\'e}rez-P{\'e}rez, Martin and P{\'e}rez-Rodr{\'\i}guez, Gael and Rabal, Obdulia and Vazquez, Miguel and Oyarzabal, Julen and Fdez-Riverola, Florentino and Valencia, Alfonso and Krallinger, Martin and Louren{\c{c}}o, An{\'a}lia}, journal={Database}, volume={2016}, year={2016}, publisher={Oxford Academic}}


Related Resources:

Files (274.7 kB)
Name Size
274.7 kB Download
All versions This version
Views 371371
Downloads 305305
Data volume 83.8 MB83.8 MB
Unique views 342342
Unique downloads 288288


Cite as