Digitisation infrastructure design for Zenodo. Deliverable D6.3
Nielsen, Lars Holm;
This deliverable specifies the requirements for adapting CERN’s Zenodo services for long-term storage of large-scale digitised biodiversity data. The report describes the service(s) features, capacities, functions and its suitability. Recommendations of use and possible designs are also included. The report is structured in eight parts:
The context section describes the setting of the pilot, including the institutions involved (sources and services providers) and the actors within those institutions, the processes covered, and services provided.
The infrastructure section depicts the facilities supporting the integration of service providers and consumers, particularly APIs, programming languages, protocols, and speeds.
The data section provides an overview of the data model used for storing digital specimen data requiring long-term preservation, including the minimal data required for making a deposit, as well as data and metadata, which are part of each digital specimen.
The design section describes the overall architectural view of the implemented solution.
The test case section describes the upload of two data sets, a 2K of herbarium sheets with rich metadata and large digital files and a 281K of herbarium sheets with limited metadata.
The recommendation section provides practical recommendations to use the service(s) evaluated in the pilot, the benefits and trade-offs.
The references section provides the full references of bibliographic citations.