10.5281/zenodo.18003
https://zenodo.org/records/18003
oai:zenodo.org:18003
McMurry, Julie
Julie
McMurry
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Blomberg, Niklas
Niklas
Blomberg
ELIXIR Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Burdett, Tony
Tony
Burdett
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Conte, Nathalie
Nathalie
Conte
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Dumontier, Michel
Michel
Dumontier
Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
Fellows, Donal K
Donal K
Fellows
School of Computer Science, The University of Manchester, Manchester, United Kingdom
Gonzalez-Beltran, Alejandra
Alejandra
Gonzalez-Beltran
Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
Gormanns, Philipp
Philipp
Gormanns
Institute of Experimental Genetics, Helmholtz Centre Munich -German Research Center for Environmental Health (GmbH), Neuherberg, Germany
Hastings, Janna
Janna
Hastings
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Haendel, Melissa A
Melissa A
Haendel
Department of Medical Informatics and Epidemiology and OHSU Library, Oregon Health & Science University, Portland, USA.
Hermjakob, Henning
Henning
Hermjakob
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Hériché, Jean-Karim
Jean-Karim
Hériché
European Molecular Biology Laboratory, Heidelberg, Germany
Ison, Jon C
Jon C
Ison
Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
Jimenez, Rafael C
Rafael C
Jimenez
ELIXIR Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Jupp, Simon
Simon
Jupp
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Juty, Nick
Nick
Juty
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Laibe, Camille
Camille
Laibe
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Le Novère, Nicolas
Nicolas
Le Novère
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom | Babraham Institute, Cambridge, United Kingdom
Malone, James
James
Malone
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Martin, Maria J
Maria J
Martin
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
McEntyre, Johanna R
Johanna R
McEntyre
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
Morris, Chris
Chris
Morris
STFC, Daresbury Laboratory, Warrington, United Kingdom
Muilu, Juha
Juha
Muilu
Genomics Coordination Center, Department of Genetics, University Medical Center Groningen and Groningen Bioinformatics Center, University of Groningen, Groningen, Netherlands
Müller, Wolfgang
Wolfgang
Müller
SDBV, HITS, Heidelberg, Germany
Mungall, Christopher J
Christopher J
Mungall
Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Rocca-Serra, Philippe
Philippe
Rocca-Serra
Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
Sansone, Susanna-Assunta
Susanna-Assunta
Sansone
Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
Sariyar, Murat
Murat
Sariyar
Institute of Pathology, Charite – University Medicine Berlin, Berlin, Germany | TMF – Technologie- und Methodenplattform e. V. Berlin, Germany
Snoep, Jacky L
Jacky L
Snoep
MIB, University of Manchester, Manchester, UK | Department of Biochemistry, Stellenbosch University, Stellenbosch, South Africa
Stanford, Natalie J
Natalie J
Stanford
School of Computer Science, The University of Manchester, Manchester, United Kingdom
Swainston, Neil
Neil
Swainston
Manchester Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM), University of Manchester, Manchester, UK.
Washington, Nicole
Nicole
Washington
Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Williams, Alan R
Alan R
Williams
School of Computer Science, The University of Manchester, Manchester, United Kingdom
Wolstencroft, Katherine
Katherine
Wolstencroft
Leiden Institute of Advanced Computer Science, Leiden University, Leiden, Netherlands
Goble, Carole
Carole
Goble
School of Computer Science, The University of Manchester, Manchester, United Kingdom
Parkinson, Helen
Helen
Parkinson
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
10 Simple rules for design, provision, and reuse of persistent identifiers for life science data
Zenodo
2015
Identifiers
Identifier design
Reproducibility
e-Science
Big data
Accessions
Databases
Interoperability
Synthesis research
Standards
Open science
2015-05-26
10.5281/zenodo.31765
10.5281/zenodo.610288
https://zenodo.org/communities/eu
Creative Commons Attribution 4.0 International
In the life sciences, problems with identifiers impede the flow and integrity of information. This is especially challenging within “synthesis research” disciplines such as systems biology, translational medicine, and ecology. Implementation-driven initiatives such as ELIXIR, BD2K, and others have therefore been actively working to understand and address underlying problems with identifiers.
Good, global-scale, persistent identifier design is harder than it appears, and is essential for data to be Findable, Accessible, Interoperable, and Reusable (Data FAIRport principles). Here, we build on emerging conventions and existing general recommendations and summarise the identifier characteristics most important to optimising the utility of life-science data. We propose actions to take in the identifier ‘green field’ and offer guidance for using real-world identifiers from diverse sources.
ORCIDs corresponding to the authors are:
http://orcid.org/0000-0002-9353-5498
http://orcid.org/0000-0003-4155-5910
http://orcid.org/0000-0002-2513-5396
http://orcid.org/0000-0002-1010-3121
http://orcid.org/0000-0003-4727-9435
http://orcid.org/0000-0002-9091-5938
http://orcid.org/0000-0003-3499-8262
http://orcid.org/0000-0001-9823-1621
http://orcid.org/0000-0002-3469-4923
http://orcid.org/0000-0001-9114-8737
http://orcid.org/0000-0001-8479-0262
http://orcid.org/0000-0001-6867-9425
http://orcid.org/0000-0001-6666-1520
http://orcid.org/0000-0001-5404-7670
http://orcid.org/0000-0002-0643-3144
http://orcid.org/0000-0002-2036-8350
http://orcid.org/0000-0002-4625-743X
http://orcid.org/0000-0002-6309-7327
http://orcid.org/0000-0002-1615-2899
http://orcid.org/0000-0001-5454-2815
http://orcid.org/0000-0002-1611-6935
http://orcid.org/0000-0002-9533-5684
http://orcid.org/0000-0002-1034-5171
http://orcid.org/0000-0002-4980-3512
http://orcid.org/0000-0002-6601-2165
http://orcid.org/0000-0001-9853-5668
http://orcid.org/0000-0001-5306-5690
http://orcid.org/0000-0002-5595-689X
http://orcid.org/0000-0002-0405-8854
http://orcid.org/0000-0003-4958-0184
http://orcid.org/0000-0001-7020-1236
http://orcid.org/0000-0001-8936-9143
http://orcid.org/0000-0003-3156-2105
http://orcid.org/0000-0002-1279-5133
http://orcid.org/0000-0003-1219-2137
http://orcid.org/0000-0003-3035-4195
European Commission
10.13039/501100000780
284209
Building data bridges between biological and medical infrastructures in Europe
European Commission
10.13039/501100000780
601043
DIACHRON – Managing the Evolution and Preservation of the Data Web
European Commission
10.13039/501100000780
312455
Infrastructure for Systems Biology - Europe
European Commission
10.13039/501100000780
211601
European Life-science Infrastructure for Biological Information