There is a newer version of the record available.

Published February 16, 2017 | Version v1
Proposal Open

The Federated Scientific Data Hub

Creators

  • 1. CERN

Description

The paper proposes to establish a federated scientific data hub to deliver trusted data-intensive services in a federated environment where data complies with the “FAIR” principles (Findable, Accessible, Interoperable, Re-usable). It will leverage existing services across Member States and disciplinary, social and geographical borders.

The data hub will ensure the preservation of scientific data, together with the software, documentation and computing environment needed to process, (re-)analyse or otherwise (re-)use the data for the most advanced user communities that are willing to engage and contribute. Target data volumes in the exabyte range are envisaged with data management plans spanning at least one decade.

The data hub will offer production quality data-intensive services drawn from both public and private sources and presented through a common service catalogue. The services must conform to a minimum set of technical, legal and security requirements that ensure their suitability for inclusion. An initial set of interoperable services will include data management, compute (including HPC) and ‘Human’ services such as consultancy, s/w development, training etc.   A combined service catalogue, service credit scheme and pay-for-usage model will make it easy to grow the data hub to include new funding streams, users and service providers.

The funding model will mobilise public and private sector funding with a mix of capital and operational budgets, involvement of commercial service providers as well as public-sector e-infrastructure operators and a broad spectrum of stakeholders. Separate models for capital investment, operational charges and charging structures for scholarly publishing will be combined to cover the whole lifecycle of the platform. Funding schemes that vary from country to country and between communities need to be carefully assessed and reflected in the selected models.

A multi-stakeholder governance model will be established to collaborate with and not replace the existing governance models for research infrastructures, scientific user communities as well as the e-infrastructure projects and initiatives that contribute services.

Files

FedSciDataHub.pdf

Files (240.2 kB)

Name Size Download all
md5:2c3ed4c12ed61584ce6b01ca58d06738
240.2 kB Preview Download