Report Open Access

Provisional Data Management Plan for DiSSCo infrastructure. Deliverable D6.6

Hardisty, Alex

JSON-LD ( Export

  "description": "<p>Excecutive summary</p>\n\n<p>DiSSCo, &nbsp;the &nbsp;&ldquo;Distributed &nbsp;System &nbsp;of &nbsp;Scientific &nbsp;Collections, &nbsp;is &nbsp;a &nbsp;pan\u2010European &nbsp;Research &nbsp;Infrastructure mobilising, unifying and delivering bio\u2010 and geo\u2010diversity digital information to scientific communities and beyond as a single digital virtual collection. With approximately 1.5 billion objects to be digitised, bringing natural science collections to the information age is expected to result in 100 petabytes of new data over the next two decades, used on average by 5,000 &ndash; 15,000 unique users every day.&nbsp;<br>\nThe DiSSCo Data Management Plan (DMP) is a living document reflecting the active data management planning and stewardship philosophy of DiSSCo, with focus on achieving maximum accessibility and reusability of data according to core principles of &nbsp;&#39;findable, &nbsp;accessible, &nbsp;interoperable &nbsp;and &nbsp;reusable&#39; &nbsp;(FAIR), &nbsp;longevity &nbsp;of &nbsp;data &nbsp;and &nbsp;data &nbsp;preservation, community curation, linking to third\u2010party information and reproducible science. The DiSSCo DMP offers unified data management principles &nbsp;for data providers, data managers and users, and guidance to engineers and programmers on technical standards and best practices. It applies to data management &nbsp;activities &nbsp;(production &nbsp;and &nbsp;acquisition, &nbsp;curation, &nbsp;publishing, &nbsp;processing &nbsp;and &nbsp;use) &nbsp;of &nbsp;the geographically &nbsp;distributed &nbsp;collection\u2010holding &nbsp;organisations &nbsp;(the &nbsp;DiSSCo &nbsp;acilities) &nbsp;and &nbsp;to &nbsp;all &nbsp;DiSSCo &nbsp;Hub activities.&nbsp;<br>\nDiSSCo adopts Digital Object Architecture (DOA) as its foundation because of its future\u2010proof flexibility over long timescales in the face of technological change, and because DOA has been shown to offer adherence to the FAIR principles as an integral characteristic, providing mechanisms inherently that directly address the specific principles to be promoted. In DOA the core concept is the &lsquo;digital object&rsquo;.&nbsp;<br>\nDigitisation is the process of making data about physical objects digitally available, and the output of that process &nbsp;is Digital &nbsp;Specimens &nbsp;and Digital &nbsp;Collections. Digital &nbsp;Specimens &nbsp;and Digital &nbsp;Collections &nbsp;are &nbsp;specific types of &lsquo;digital objects&rsquo;, which are the fundamental entities to be the subject of data management in DiSSCo.&nbsp;<br>\nEach instance of a digital object collects and organizes all the core information about the physical things it represents. &nbsp;These &nbsp;identified &nbsp;objects &nbsp;are &nbsp;amenable &nbsp;to &nbsp;processing &nbsp;and &nbsp;to &nbsp;transport &nbsp;from &nbsp;one &nbsp;system &nbsp;to another, making DOA a powerful yet simple extension of the existing Internet. A link must be maintained by the Digital Specimen to the physical specimen it represents. This link is the identifier of the physical specimen. These Digital Specimen objects are the principal data that DiSSCo manages. &nbsp;Each &nbsp;Digital &nbsp;Specimen &nbsp;or &nbsp;other &nbsp;digital &nbsp;object &nbsp;instance &nbsp;handled &nbsp;by &nbsp;the &nbsp;DiSSCo &nbsp;infrastructure &nbsp;must &nbsp;be unambiguously, universally and persistently identified by an identifier (Natural Science Identifier, NSId) which shall &nbsp;be &nbsp;assigned &nbsp;when &nbsp;the &nbsp;object &nbsp;is &nbsp;first &nbsp;created. &nbsp;Each &nbsp;DiSSCo &nbsp;Facility &nbsp;shall &nbsp;be &nbsp;responsible &nbsp;for &nbsp;creating (minting) and managing their own NSIds in accordance with the DiSSCo policy for NSIds, and for registering their own Digital Specimens with the DiSSCo Hub infrastructure. Resolution of an NSId shall always return the current version of an object&rsquo;s content, as well as any interpretations and annotations associated with it.&nbsp;<br>\nThe principle object types in DiSSCo (Digital Specimens, Digital Collections) are treated as mutable objects with access control and object history (provenance), meaning that they can be updated as new knowledge becomes available. Provenance data must be generated and preserved by all operations acting upon DiSSCo data objects. Timestamped records of change (provenance data) allow reconstruction of a specific &lsquo;version&rsquo; of a digital object at a date and time in the past.&nbsp;<br>\nInformation about Digital Specimens and Digital Collections must be published and managed as part of the European Collection Objects Index. DiSSCo Facilities are encouraged to publish the &nbsp;fullest available digital data about their individual specimens and collections at the earliest opportunity, aiming as best practice to achieve at least MIDS level 2 for Digital Specimens and MICS level 2 for Digital Collections information.&nbsp;<br>\nSeveral characteristics, such as centrality, accuracy and authenticity of the Digital Specimen, protection of data, &nbsp;preservation &nbsp;of &nbsp;readability, &nbsp;traceability/provenance, &nbsp;and &nbsp;annotation &nbsp;history &nbsp;are &nbsp;essential &nbsp;for developing long\u2010term community trust in DiSSCo. They are the protected characteristics of DiSSCo that must be protected throughout the DiSSCo lifetime. Thus, all design decisions (technical, procedural, organisational, etc.) must be assessed for their effect on the protected characteristics. Such decisions and changes must not destroy or lessen the protected characteristics.&nbsp;</p>", 
  "license": "", 
  "creator": [
      "affiliation": "Cardiff University: Cardiff, South Glamorgan, GB", 
      "@id": "", 
      "@type": "Person", 
      "name": "Hardisty, Alex"
  "headline": "Provisional Data Management Plan for DiSSCo infrastructure. Deliverable D6.6", 
  "image": "", 
  "datePublished": "2019-11-08", 
  "url": "", 
  "keywords": [
    "data management plan", 
    "digital specimen", 
    "data infrastructure", 
    "data management"
  "@context": "", 
  "identifier": "", 
  "@id": "", 
  "@type": "ScholarlyArticle", 
  "name": "Provisional Data Management Plan for DiSSCo infrastructure. Deliverable D6.6"
All versions This version
Views 911912
Downloads 212212
Data volume 359.8 MB359.8 MB
Unique views 880881
Unique downloads 190190


Cite as