Published December 2, 2015 | Version v1
Journal article Open

Key components of data publishing: Using current best practices to develop a reference model for data publishing

  • 1. Environment Canada
  • 2. BMJ
  • 3. CERN
  • 4. Nature Publishing Group
  • 5. Maverick
  • 6. Columbia University
  • 7. Woods Hole Oceanographic Institution
  • 8. German Climate Computing Centre (DKRZ)
  • 9. University of Leicester
  • 10. University of Michigan/ICPSRn
  • 11. DCC

Description

Additional Contributors:

Tim Clark, Eleni Castro, Elizabeth Newbold, Samuel Moore, Brian Hole

This is the revised version of: 

Bloom, Theodora et al.. (2015). Workflows for Research Data Publishing: Models and Key Components (Submitted Version). Zenodo. 10.5281/zenodo.20308

Abstract

Purpose:

Availability of workflows for data publishing could have an enormous impact on researchers, research practices and publishing paradigms, as well as on funding strategies and career and research evaluations. We present the generic components of such workflows in order to provide a reference model for these stakeholders.

Methods:

The RDA-WDS Data Publishing Workflows group set out to study the current data publishing workflow landscape across disciplines and institutions. A diverse set of workflows were examined to identify common components and standard practices, including basic self-publishing services, institutional data repositories, long term projects, curated data repositories, and joint data journal and repository arrangements.

Results:

The results of this examination have been used to derive a data publishing reference model comprised of generic components. From an assessment of the current data publishing landscape, we highlight important gaps and challenges to consider, especially when dealing with more complex workflows and their integration into wider community frameworks.

Conclusions:

It is clear that the data publishing landscape is varied and dynamic, and that there are important gaps and challenges. The different components of a data publishing system need to work, to the greatest extent possible, in a seamless and integrated way. We therefore advocate the implementation of existing standards for repositories and all parts of the data publishing process, and the development of new standards where necessary. Effective and trustworthy data publishing should be embedded in documented workflows. As more research communities seek to publish the data associated with their research, they can build on one or more of the components identified in this reference model. 

Files

Files (226.9 kB)

Name Size Download all
md5:a6e3e1c61076552c3fd3d321463772d7
226.9 kB Download