UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.

Project deliverable Open Access

FAIR-EASE_D4.2_Landscaping exercise_The inclusion of special use case datasets in the data lake

Nydia Catalina Reyes Suarez; Mark Portier; Alessandra Giorgetti; Reiner Schlitzer; Giuliano Langella; Marie Boichu; Vincent Breton; Virgine Racapé; Cymon J. Cox


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <controlfield tag="005">20230523022650.0</controlfield>
  <controlfield tag="001">7957747</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">VLIZ</subfield>
    <subfield code="0">(orcid)0000-0002-9648-6484</subfield>
    <subfield code="a">Mark Portier</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">OGS</subfield>
    <subfield code="0">(orcid)0000-0002-0914-4831</subfield>
    <subfield code="a">Alessandra Giorgetti</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">AWI</subfield>
    <subfield code="0">(orcid)0000-0002-3740-6499</subfield>
    <subfield code="a">Reiner Schlitzer</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">UNINA</subfield>
    <subfield code="0">(orcid)0000-0001-7210-0906</subfield>
    <subfield code="a">Giuliano Langella</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ULille</subfield>
    <subfield code="0">(orcid)0000-0003-3163-8325</subfield>
    <subfield code="a">Marie Boichu</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">CNRS</subfield>
    <subfield code="0">(orcid)0000-0001-8197-7080</subfield>
    <subfield code="a">Vincent Breton</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">POKAPOK</subfield>
    <subfield code="0">(orcid)0000-0003-0239-5125</subfield>
    <subfield code="a">Virgine Racapé</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">CCMAR</subfield>
    <subfield code="0">(orcid)0000-0002-4927-979X</subfield>
    <subfield code="a">Cymon J. Cox</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">7434516</subfield>
    <subfield code="z">md5:5c39736545fe74183039af8973fd55bf</subfield>
    <subfield code="u">https://zenodo.org/record/7957747/files/FAIR-EASE_D4.2_Landscaping exercise_The inclusion of special use case datasets in the data lake.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2023-05-05</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-eosc_fairease</subfield>
    <subfield code="o">oai:zenodo.org:7957747</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">OGS</subfield>
    <subfield code="0">(orcid)0000-0002-3906-471X</subfield>
    <subfield code="a">Nydia Catalina Reyes Suarez</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">FAIR-EASE_D4.2_Landscaping exercise_The inclusion of special use case datasets in the data lake</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-eosc_fairease</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">101058785</subfield>
    <subfield code="a">FAIR EArth Sciences &amp; Environment services</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This document describes the landscaping exercise proposed for deliverable 4.2 (D4.2) within Work Package (WP) 4 of the FAIR Earth Sciences &amp;amp; Environment services project (FAIR-EASE, FE). The goal of this exercise is to analyse different special use case (UC) datasets per pilot and the requirements they must meet to be included in the data lake infrastructure proposed in D4.1 (landscaping exercise: the (meta)data, software, and cloud needs for the data lake). The pilots per UC are:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;UC1 - Earth and Environmental Dynamics: Coastal waters dynamics (Pilot 5.1.1), Earth Critical zones observatory (Pilot 5.1.2), and Volcano Space Observatory (Pilot 5.1.3),&lt;/li&gt;
	&lt;li&gt;UC2 - Environmental Bio-geochemical Assets: Ocean Bio-Geo-Chemical Observatory (Pilot 5.2.1) and,&lt;/li&gt;
	&lt;li&gt;UC3 - Biodiversity Observation: Marine Omics Observatory (Pilot 5.3.1).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Datasets from each pilot were selected from Table 1 in Annex A of D5.1 (report on key requirements from Use Cases and Pilots, [1]) and with a prior selection from a representative from each pilot. These datasets were selected to cover as much diversity as possible and to reflect the multidisciplinary nature of each UC. The deliverable aims to analyse and highlight the criticalities of the selected datasets considering their current limitations and needs and how they could fit into the &amp;ldquo;data provider&amp;rdquo; view proposed for the &amp;ldquo;data lakes&amp;rdquo; architecture in D4.1.&lt;/p&gt;

&lt;p&gt;Bear in mind that the datasets described should not be taken as the only source for the data lake ingestion. As stated, before a few special UCs datasets were selected using the minimum selection and maximum diversity criteria, to analyse the requirements for them to be ingested in the data lakes proposed.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.7957746</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.7957747</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">deliverable</subfield>
  </datafield>
</record>
80
67
views
downloads
All versions This version
Views 8080
Downloads 6767
Data volume 498.1 MB498.1 MB
Unique views 7474
Unique downloads 6262

Share

Cite as