There is a newer version of this record available.

Preprint Open Access

10 Simple rules for design, provision, and reuse of persistent identifiers for life science data

McMurry, Julie; Blomberg, Niklas; Burdett, Tony; Conte, Nathalie; Dumontier, Michel; Fellows, Donal K; Gonzalez-Beltran, Alejandra; Gormanns, Philipp; Hastings, Janna; Haendel, Melissa A; Hermjakob, Henning; Hériché, Jean-Karim; Ison, Jon C; Jimenez, Rafael C; Jupp, Simon; Juty, Nick; Laibe, Camille; Le Novère, Nicolas; Malone, James; Martin, Maria J; McEntyre, Johanna R; Morris, Chris; Muilu, Juha; Müller, Wolfgang; Mungall, Christopher J; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Sariyar, Murat; Snoep, Jacky L; Stanford, Natalie J; Swainston, Neil; Washington, Nicole; Williams, Alan R; Wolstencroft, Katherine; Goble, Carole; Parkinson, Helen


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Identifiers</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Identifier design</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Reproducibility</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">e-Science</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Big data</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Accessions</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Databases</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Interoperability</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Synthesis research</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Standards</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Open science</subfield>
  </datafield>
  <controlfield tag="005">20200120174336.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">ORCIDs corresponding to the authors are:
http://orcid.org/0000-0002-9353-5498
http://orcid.org/0000-0003-4155-5910
http://orcid.org/0000-0002-2513-5396
http://orcid.org/0000-0002-1010-3121
http://orcid.org/0000-0003-4727-9435
http://orcid.org/0000-0002-9091-5938
http://orcid.org/0000-0003-3499-8262
http://orcid.org/0000-0001-9823-1621
http://orcid.org/0000-0002-3469-4923
http://orcid.org/0000-0001-9114-8737
http://orcid.org/0000-0001-8479-0262
http://orcid.org/0000-0001-6867-9425
http://orcid.org/0000-0001-6666-1520
http://orcid.org/0000-0001-5404-7670
http://orcid.org/0000-0002-0643-3144
http://orcid.org/0000-0002-2036-8350
http://orcid.org/0000-0002-4625-743X
http://orcid.org/0000-0002-6309-7327
http://orcid.org/0000-0002-1615-2899
http://orcid.org/0000-0001-5454-2815
http://orcid.org/0000-0002-1611-6935
http://orcid.org/0000-0002-9533-5684
http://orcid.org/0000-0002-1034-5171
http://orcid.org/0000-0002-4980-3512
http://orcid.org/0000-0002-6601-2165
http://orcid.org/0000-0001-9853-5668
http://orcid.org/0000-0001-5306-5690
http://orcid.org/0000-0002-5595-689X
http://orcid.org/0000-0002-0405-8854
http://orcid.org/0000-0003-4958-0184
http://orcid.org/0000-0001-7020-1236
http://orcid.org/0000-0001-8936-9143
http://orcid.org/0000-0003-3156-2105
http://orcid.org/0000-0002-1279-5133
http://orcid.org/0000-0003-1219-2137
http://orcid.org/0000-0003-3035-4195</subfield>
  </datafield>
  <controlfield tag="001">18003</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ELIXIR Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Blomberg, Niklas</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Burdett, Tony</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Conte, Nathalie</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA</subfield>
    <subfield code="a">Dumontier, Michel</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">School of Computer Science, The University of Manchester, Manchester, United Kingdom</subfield>
    <subfield code="a">Fellows, Donal K</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom</subfield>
    <subfield code="a">Gonzalez-Beltran, Alejandra</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Institute of Experimental Genetics, Helmholtz Centre Munich -German Research Center for Environmental Health (GmbH), Neuherberg, Germany </subfield>
    <subfield code="a">Gormanns, Philipp</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Hastings, Janna</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Department of Medical Informatics and Epidemiology and OHSU Library, Oregon Health &amp; Science University, Portland, USA.</subfield>
    <subfield code="a">Haendel, Melissa A</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Hermjakob, Henning</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Molecular Biology Laboratory, Heidelberg, Germany</subfield>
    <subfield code="a">Hériché, Jean-Karim</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark</subfield>
    <subfield code="a">Ison, Jon C</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ELIXIR Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Jimenez, Rafael C</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Jupp, Simon</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Juty, Nick</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Laibe, Camille</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom | Babraham Institute, Cambridge, United Kingdom</subfield>
    <subfield code="a">Le Novère, Nicolas</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Malone, James</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Martin, Maria J</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">McEntyre, Johanna R</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">STFC, Daresbury Laboratory, Warrington, United Kingdom</subfield>
    <subfield code="a">Morris, Chris</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Genomics Coordination Center, Department of Genetics, University Medical Center Groningen and Groningen Bioinformatics Center, University of Groningen, Groningen, Netherlands</subfield>
    <subfield code="a">Muilu, Juha</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">SDBV, HITS, Heidelberg, Germany</subfield>
    <subfield code="a">Müller, Wolfgang</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA</subfield>
    <subfield code="a">Mungall, Christopher J</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom</subfield>
    <subfield code="a">Rocca-Serra, Philippe</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom</subfield>
    <subfield code="a">Sansone, Susanna-Assunta</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Institute of Pathology, Charite – University Medicine Berlin, Berlin, Germany | TMF – Technologie- und Methodenplattform e. V. Berlin, Germany</subfield>
    <subfield code="a">Sariyar, Murat</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">MIB, University of Manchester, Manchester, UK | Department of Biochemistry, Stellenbosch University, Stellenbosch, South Africa</subfield>
    <subfield code="a">Snoep, Jacky L</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">School of Computer Science, The University of Manchester, Manchester, United Kingdom</subfield>
    <subfield code="a">Stanford, Natalie J</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Manchester Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM), University of Manchester, Manchester, UK.</subfield>
    <subfield code="a">Swainston, Neil</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA</subfield>
    <subfield code="a">Washington, Nicole</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">School of Computer Science, The University of Manchester, Manchester, United Kingdom</subfield>
    <subfield code="a">Williams, Alan R</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Leiden Institute of Advanced Computer Science, Leiden University, Leiden, Netherlands</subfield>
    <subfield code="a">Wolstencroft, Katherine</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">School of Computer Science, The University of Manchester, Manchester, United Kingdom</subfield>
    <subfield code="a">Goble, Carole</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">Parkinson, Helen</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1111169</subfield>
    <subfield code="z">md5:09f2036e95d7f259bc75ac3e0af75c08</subfield>
    <subfield code="u">https://zenodo.org/record/18003/files/MS_2015-05-23.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2015-05-26</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="o">oai:zenodo.org:18003</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom</subfield>
    <subfield code="a">McMurry, Julie</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">10 Simple rules for design, provision, and reuse of persistent identifiers for life science data</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">284209</subfield>
    <subfield code="a">Building data bridges between biological and medical infrastructures in Europe</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">601043</subfield>
    <subfield code="a">DIACHRON – Managing the Evolution and Preservation of the Data Web</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">312455</subfield>
    <subfield code="a">Infrastructure for Systems Biology - Europe</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">211601</subfield>
    <subfield code="a">European Life-science Infrastructure for Biological Information</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;In the life sciences, problems with identifiers impede the flow and integrity of information. This is especially challenging within &amp;ldquo;synthesis research&amp;rdquo; disciplines such as systems biology, translational medicine, and ecology. Implementation-driven initiatives such as ELIXIR, BD2K, and others have therefore been actively working to understand and address underlying problems with identifiers.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Good, global-scale, persistent identifier design is harder than it appears, and is essential for data to be Findable, Accessible, Interoperable, and Reusable (Data FAIRport principles). Here, we build on emerging conventions and existing general recommendations &amp;nbsp;and summarise the identifier characteristics most important to optimising the utility of life-science data. We propose actions to take in the identifier &amp;lsquo;green field&amp;rsquo; and offer guidance for using real-world identifiers from diverse sources.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isPreviousVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.31765</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.610288</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.18003</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">preprint</subfield>
  </datafield>
</record>
1,451
201
views
downloads
All versions This version
Views 1,451763
Downloads 201138
Data volume 158.8 MB153.3 MB
Unique views 1,383746
Unique downloads 180131

Share

Cite as