Journal article Open Access

DrugProt corpus genes and proteins annotation guidelines [GPRO - Biocreative 5.2]

Rabal, Obdulia; Krallinger, Martin


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">NLP</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">biomedical NLP</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">biocreative</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">NER</subfield>
  </datafield>
  <controlfield tag="005">20221028113558.0</controlfield>
  <controlfield tag="001">4957577</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Barcelona Supercomputing Center</subfield>
    <subfield code="0">(orcid)0000-0002-2646-8782</subfield>
    <subfield code="a">Krallinger, Martin</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">274734</subfield>
    <subfield code="z">md5:f28e78d398207ef610affeb7efabe800</subfield>
    <subfield code="u">https://zenodo.org/record/4957577/files/GPRO_guidelines.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-06-15</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-medicalnlp</subfield>
    <subfield code="o">oai:zenodo.org:4957577</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Barcelona Supercomputing Center</subfield>
    <subfield code="a">Rabal, Obdulia</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">DrugProt corpus genes and proteins annotation guidelines [GPRO - Biocreative 5.2]</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-medicalnlp</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Annotation guidelines&amp;nbsp;used for the&amp;nbsp;annotations of gene and protein-related objects of the CHEMDNER, ChemProt and DrugProt&amp;nbsp;corpora.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Please cite if you use any DrugProt resource:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Miranda, Antonio, et al. &amp;quot;Overview of DrugProt BioCreative VII track: quality evaluation and large scale text mining of drug-gene/protein relations.&amp;quot;&amp;nbsp;&lt;em&gt;Proceedings of the seventh BioCreative challenge evaluation workshop&lt;/em&gt;. 2021.&lt;/p&gt;

&lt;pre&gt;&lt;code&gt;@inproceedings{miranda2021overview,
  title={Overview of DrugProt BioCreative VII track: quality evaluation and large scale text mining of drug-gene/protein relations},
  author={Miranda, Antonio and Mehryary, Farrokh and Luoma, Jouni and Pyysalo, Sampo and Valencia, Alfonso and Krallinger, Martin},
  booktitle={Proceedings of the seventh BioCreative challenge evaluation workshop},
  year={2021}
}&lt;/code&gt;&lt;/pre&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Introduction&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The&amp;nbsp;annotation guidelines have been refined after iterative cycles of annotations of sample documents. It also incorporated suggestions made by curators as well as observations of annotation inconsistencies encountered when comparing results from different human curators.&lt;/p&gt;

&lt;p&gt;In brief, the annotated GPROs include genes, gene products&lt;br&gt;
(proteins, RNA), DNA/protein sequence elements and protein families, domains and complexes. The aim of the iterative manual annotation cycles was to improve the quality and consistency of the guidelines, in order to make them more intuitive and easier to follow.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Please, cite:&lt;/p&gt;

&lt;p&gt;@article{perez2016markyt,&amp;nbsp;title={The Markyt visualisation, prediction and benchmark platform for chemical and gene entity recognition at BioCreative/CHEMDNER challenge},&amp;nbsp;author={P{\&amp;#39;e}rez-P{\&amp;#39;e}rez, Martin and P{\&amp;#39;e}rez-Rodr{\&amp;#39;\i}guez, Gael and Rabal, Obdulia and Vazquez, Miguel and Oyarzabal, Julen and Fdez-Riverola, Florentino and Valencia, Alfonso and Krallinger, Martin and Louren{\c{c}}o, An{\&amp;#39;a}lia},&amp;nbsp;journal={Database},&amp;nbsp;volume={2016},&amp;nbsp;year={2016},&amp;nbsp;publisher={Oxford Academic}}&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Related Resources:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;a href="https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-1/"&gt;Web&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.4955410"&gt;DrugProt corpus&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://github.com/tonifuc3m/drugprot-evaluation-library"&gt;Evaluation library&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://codalab.lisn.upsaclay.fr/competitions/8293"&gt;Online evaluation (CodaLab)&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.4957137"&gt;Relation annotation guidelines&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.4957576"&gt;Gene and protein annotation guidelines&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.4957518"&gt;Chemicals and drugs annotation guidelines&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.7252201"&gt;DrugProt Silver Standard Knowledge Graph&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.5042178"&gt;FAQ&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.5119878"&gt;DrugProt Large Scale Additional SubTrack&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;&lt;a href="https://doi.org/10.5281/zenodo.5656991"&gt;DrugProt Large Scale document collection protocol&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4957576</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4957577</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
  </datafield>
</record>
411
332
views
downloads
All versions This version
Views 411411
Downloads 332332
Data volume 91.2 MB91.2 MB
Unique views 379379
Unique downloads 314314

Share

Cite as