Published May 8, 2026 | Version v1
Dataset Open

Results from a survey of major biological databases on the availability of metadata fields relevant to Digital Sequence Information (DSI)

  • 1. German Federation for Biological Data (GFBio e.V.)
  • 2. ROR icon Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures
  • 3. Gesellschaft fuer Biologische Daten e.V. GFBio
  • 4. National Genomics Data Center
  • 5. ROR icon Beijing Institute of Genomics
  • 6. National Institute of Genetics
  • 7. Tim Hirsch Consulting
  • 8. Universiteit Leiden Instituut Biologie Leiden
  • 9. ROR icon Naturalis Biodiversity Center
  • 10. ROR icon Alliance Bioversity International - CIAT
  • 11. ROR icon CGIAR
  • 12. ROR icon Leibniz Institute of Plant Genetics and Crop Plant Research
  • 13. Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)
  • 14. ROR icon Heinrich Heine University Düsseldorf
  • 15. ROR icon Forschungszentrum Jülich
  • 16. ROR icon European Molecular Biology Laboratory
  • 17. ROR icon European Bioinformatics Institute
  • 18. Office of the Chief Science Advisor Canada
  • 19. ROR icon University of Victoria
  • 20. EDMO icon Leibniz Institute, German Collection of Microorganisms and Cell Cultures

Description

This README accompanies Supplementary Table S1 from the FAR-DSI project (July 2023), published in the peer-reviewed paper Raposo et al. 2026. The table presents the results of a database survey assessing the availability of metadata in major biological databases that could support the development of a multilateral benefit-sharing mechanism on DSI.

The analysis covered two main groups of databases:

Global Core Biodata Resources (GCBRs) identified by the Global Biodata Coalition (https://globalbiodata.org/what-we-do/global-core-biodata-resources/).

Community-led databases maintained by members of the German National Research Data Infrastructure (NFDI).

Databases were first screened for alignment with the Convention on Biological Diversity (CBD); those restricted to model organisms or human genetic data were excluded. For 18 databases within scope, we examined random datasets to assess the availability of metadata fields, particularly for geographical location of collection (geographical origin), as well as the types of DSI hosted (DNA/RNA, proteins/epigenetics, or metabolites/macromolecules).

Files

Additional details

Dates

Available
2026-05