Report Open Access

Best Practices for Archival Processing of Research Objects (a librarian view)

Sara Pérez; Oscar Corcho; Raúl Palma; Piotr Hołubowicz

A Research Object is a multidimensional digital object which comprises elements that describe the way research findings are produced since the formulation of a hypothesis, the design and execution of the experiments, the analysis of results and the conclusion makings. The concept of Research Object is largely motivated by the needs to address the actual challenges associated with knowledge communication.

These new digital objects have been defined in the context of the European Wf4Ever project (http://www.wf4ever-project.org/), focused on the provision of technological support for their preservation and efficient retrieval and reuse.

Following the approach developed by the GeoMAPP’s [1] previous awarded work this document presents a suggested workflow for archival processing of Research Objects. These best practices are for librarians who want to support research preservation with novel techniques and revolutionize their conventional role as document stewards – they can start to preserve not only research data, but also the methods and the documentation of the role of these data and methods in leading to the outcomes reported in digital publications. In particular, these best practices intend to be useful for digital libraries managers and designers of any scientific workflow infrastructure.

The first part of the document offers a brief introduction to OAIS, as it provides the general framework and basis for an archival processing workflow. In the second part, high level storage architecture is presented that supports the multi-phase archival processing workflow. The third part of the document is the major focus, and offers an RO processing workflow based on the OAIS model to receive submissions, archive, and make accessible Research Objects from archival repositories. Some key questions for an RO-infrastructure with regard to each process are presented.

The digital library framework proposed extends the traditional one, mostly used for documents and non- executable digital content, with the notion of dynamic digital objects linked with semantic relations, supporting the evolution of these objects, and with specialized preservation features, such as monitoring of workflow decay or other issues compromising the reproducibility of experiments.

This document provides information at varying levels of detail. We suggest several “trails” through it depending on your interest:

  • If you are already familiar with the notion of ROs, go directly to section 2.
  • If you are interested in the technology needed to storage ROs, read section 3.
  • If you are responsible for a library, read mainly sections 1 and 4.
Document for the Wf4ever Knowledge Base. Document identifier: KH/BP-librarians
Files (2.3 MB)
Name Size
BestPracticesArchivalProcessingROs.pdf
md5:846e72b911739926339738d6c62fed36
2.3 MB Download
  • Geospatial Multistate Archive and Preservation Partnership (GeoMAPP). Best Practices for Archival Processing for Geospatial Datasets, version 1.0 (final) 02/11/2011.

  • The Digital Library Reference Model (April 2011).

  • Reference Model for an Open Archival Information System (OAIS), Recommended Practice, CCSDS 650.0-M-2 (Magenta Book) Issue 2, June 2012

  • European Commission. Research Data e-Infrastructures: Framework for Action in Horizon 2020.

  • Producer-Archive Interface Specification (PAIS) Draft Recommended Standard CCSDS 651.1-R-1 (Red Book) Issue 1 February 2012

  • Stian Soiland-Reyes; Sean Bechhofer et al.: Wf4Ever Research Object Model. 30 November 2012.

  • W3C Community Open Annotation Data Model (OA)

  • Hettne KM, Wolstencroft K, Belhajjame K, Goble CA, Mina E, Dharuri H, De Roure D, Verdes- Montenegro L, Garrido J, Roos M: Best Practices for Workflow Design: How to Prevent Workflow Decay. In Proceedings of the 5th International Workshop on Semantic Web Applications and Tools for Life Sciences, Paris, France, November 28-30, 2012. Volume 952. Paris. France: CEUR-WS.org; 2012.

  • Minim checklist ontology

  • Gamble M, Goble C, Klyne G, Zhao J: MIM: A Minimum Information Model vocabulary and framework for Scientific Linked Data. IEEE; 2012:1–8.

  • R. González-Cabero, R. Palma, and E. García Cuesta. D3.2v1: Design, implementation and deployment of workflow evolution, sharing and collaboration components. Technical report, Universidad Politécnica de Madrid, July 2012.

24
20
views
downloads
All versions This version
Views 2424
Downloads 2020
Data volume 46.5 MB46.5 MB
Unique views 2020
Unique downloads 1616

Share

Cite as