UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.

Project deliverable Open Access

FAIR-EASE_D4.2_Landscaping exercise_The inclusion of special use case datasets in the data lake

Nydia Catalina Reyes Suarez; Mark Portier; Alessandra Giorgetti; Reiner Schlitzer; Giuliano Langella; Marie Boichu; Vincent Breton; Virgine Racapé; Cymon J. Cox

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Nydia Catalina Reyes Suarez</dc:creator>
  <dc:creator>Mark Portier</dc:creator>
  <dc:creator>Alessandra Giorgetti</dc:creator>
  <dc:creator>Reiner Schlitzer</dc:creator>
  <dc:creator>Giuliano Langella</dc:creator>
  <dc:creator>Marie Boichu</dc:creator>
  <dc:creator>Vincent Breton</dc:creator>
  <dc:creator>Virgine Racapé</dc:creator>
  <dc:creator>Cymon J. Cox</dc:creator>
  <dc:description>This document describes the landscaping exercise proposed for deliverable 4.2 (D4.2) within Work Package (WP) 4 of the FAIR Earth Sciences &amp; Environment services project (FAIR-EASE, FE). The goal of this exercise is to analyse different special use case (UC) datasets per pilot and the requirements they must meet to be included in the data lake infrastructure proposed in D4.1 (landscaping exercise: the (meta)data, software, and cloud needs for the data lake). The pilots per UC are:

	UC1 - Earth and Environmental Dynamics: Coastal waters dynamics (Pilot 5.1.1), Earth Critical zones observatory (Pilot 5.1.2), and Volcano Space Observatory (Pilot 5.1.3),
	UC2 - Environmental Bio-geochemical Assets: Ocean Bio-Geo-Chemical Observatory (Pilot 5.2.1) and,
	UC3 - Biodiversity Observation: Marine Omics Observatory (Pilot 5.3.1).

Datasets from each pilot were selected from Table 1 in Annex A of D5.1 (report on key requirements from Use Cases and Pilots, [1]) and with a prior selection from a representative from each pilot. These datasets were selected to cover as much diversity as possible and to reflect the multidisciplinary nature of each UC. The deliverable aims to analyse and highlight the criticalities of the selected datasets considering their current limitations and needs and how they could fit into the “data provider” view proposed for the “data lakes” architecture in D4.1.

Bear in mind that the datasets described should not be taken as the only source for the data lake ingestion. As stated, before a few special UCs datasets were selected using the minimum selection and maximum diversity criteria, to analyse the requirements for them to be ingested in the data lakes proposed.</dc:description>
  <dc:title>FAIR-EASE_D4.2_Landscaping exercise_The inclusion of special use case datasets in the data lake</dc:title>
All versions This version
Views 8080
Downloads 6767
Data volume 498.1 MB498.1 MB
Unique views 7474
Unique downloads 6262


Cite as