Published April 6, 2026 | Version v1
Technical note Open

From Data Model to Interoperable Metadata: Implementing the H2IOSC Resource Description Schema across a Multi-Infrastructure Pipeline

  • 1. ROR icon Istituto per il Lessico Intellettuale Europeo e la Storia delle Idee
  • 2. EDMO icon National Research Council
  • 3. Università degli Studi di Torino

Description

This paper describes the implementation of the H2IOSC resource description schema across a complete metadata pipeline, from user input to OAI-PMH protocol output. The H2IOSC project federates four European research infrastructures (OPERAS, E-RIHS, CLARIN, DARIAH) and requires a unified data model capable of describing heterogeneous scholarly resources – tools, datasets, publications, projects, services, terminology resources, learning resources, and workflows – while preserving infrastructure-specific semantics. We present the data model design, its concrete realization across four system layers (HTML form, JSON storage, native OAI-PMH XML, Dublin Core XML), and the mapping strategies employed at each transformation step. The model supports multilingual metadata, controlled vocabularies, structured identity management with role-based actor associations, and a property system that enforces type-specific constraints. We document the design decisions, the trade-offs between normalization and protocol compliance, and the validation results. The implementation is operational and serves metadata from multiple infrastructures through a validated OAI-PMH 2.0 endpoint.

Files

From Data Model to Interoperable Metadata- Implementing the H2IOSC Resource Description Schema across a Multi-Infrastructure Pipeline.pdf

Additional details

Related works

Is original form of
Technical note: 10.5281/zenodo.19409795 (DOI)
Is supplement to
Technical note: 10.5281/zenodo.19353859 (DOI)
Technical note: 10.5281/zenodo.14187534 (DOI)

References