Published August 26, 2025 | Version v1
Publication Open

GREI Metadata Recommendations from DataCite schema version 4.6

  • 1. ROR icon Digital Science (United Kingdom)
  • 2. ROR icon University of California Office of the President
  • 3. ROR icon Vivli
  • 4. ROR icon European Organization for Nuclear Research
  • 5. Harvard University Institute for Quantitative Social Science
  • 6. ROR icon Dryad Digital Repository
  • 7. ROR icon Center for Open Science
  • 8. ROR icon RELX Group (United States)

Description

This document, version 2 of the GREI Metadata Recommendations, provides updated guidelines for generalist data repositories to standardize their metadata using the DataCite Metadata Schema 4.6 and based on community feedback received on version 1. The goal is to enhance the interoperability and discoverability of datasets, particularly those from NIH-funded projects. The recommendations strongly encourage repositories to collect a specific subset of metadata properties and use designated vocabularies and values to support common use cases for sharing, discovering, and tracking the impact of data.

The core of the recommendation is a table that lists the DataCite metadata properties that the GREI repositories have agreed are most essential to FAIR data sharing, specifying recommended values and vocabularies for each field where applicable. Key recommendations include using DataCite-generated DOIs for identifiers, ORCID iDs for personal name identifiers, and ROR IDs for organizational identifiers and funder information. It also specifies using the value Dataset for resourceTypeGeneral, and Issued for the publication date.

Additional subsections of the recommendation include:

  • GREI Recommended Subset of Relation Types: This section outlines a subset of DataCite's relation types that are most useful for common interactions, providing both the DataCite relationType and a researcher-friendly label.

  • Recommendations for Controlled Vocabulary Integration in Generalist Repositories: This section recommends using controlled vocabularies and lists quality criteria for selecting them, such as governance, coverage, and persistence.

  • Proposed CRediT Role Subset for Generalist Data Repositories: This section proposes a subset of the official NISO CRediT roles that are most applicable to the lifecycle of dataset generation, curation, and publication.

  • ORCID Recommendations: This section provides best practices for collecting, authenticating, and pushing ORCID iDs to improve attribution and metadata quality.

  • Recommended Usage of Dates in GREI Metadata: This section provides guidance on using DataCite's Date and dateType fields, emphasizing that Issued must be used for the publication date and that other dates can provide additional context.

Files

GREI_Metadata_RecommendationsV02.pdf

Files (483.6 kB)

Name Size Download all
md5:612c4ec869eebcd2a7829f21d819e1a2
483.6 kB Preview Download

Additional details

Related works

Cites
Standard: 10.14454/csba-e454 (DOI)
Is new version of
Publication: 10.5281/zenodo.8101956 (DOI)

Funding

National Institutes of Health
Advancing Figshare and the generalist repository landscape to meet research community needs 3OT2DB000006-01S1
National Institutes of Health
Center for Open Science (COS) Proposal for the NIH Generalist Repository Ecosystem Initiative (GREI) 3OT2DB000001-01S1
National Institutes of Health
THE GENERALIST REPOSITORY ECOSYSTEM INITIATIVE (GREI) 1OT2DB000002-01
National Institutes of Health
Zenodo and the Generalist Repository Ecosystem Initiative (GREI) 1OT2DB000013-01
National Institutes of Health
Dryad and the Generalist Repository Ecosystem Initiative (GREI) 3OT2DB000005-01S1
National Institutes of Health
Vivli: A Generalist Repository For Clinical Trials Data 3OT2DB000003-01S1
National Institutes of Health
The Harvard Dataverse repository: A generalist repository integrated with a Data Commons 3OT2DB000004-01S1