Report Open Access

Evaluation of Erasure Coding and other features of Hadoop 3

Nazerke Seidan

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">CERN openlab</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">summer student programme</subfield>
  <controlfield tag="005">20200120173355.0</controlfield>
  <controlfield tag="001">3550780</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1884789</subfield>
    <subfield code="z">md5:0898929539159c205f98bd148bf6cbc9</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-11-22</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-cernopenlab</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Nazerke Seidan</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Evaluation of Erasure Coding and other  features of Hadoop 3</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-cernopenlab</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Erasure coding, a new feature in HDFS, can reduce storage overhead by approximately 50%&amp;nbsp;&lt;br&gt;
compared to replication while maintaining the same durability guarantees. This would allow to&amp;nbsp;&lt;br&gt;
save a lot of disk capacity in needed by project hosted in CERN IT Hadoop service. The goal of&amp;nbsp;&lt;br&gt;
the project is to evaluate the new features of Hadoop 3 and make an assessment of its readiness&amp;nbsp;&lt;br&gt;
for production systems (this includes installation and configuration of a test hadoop3 cluster,&amp;nbsp;&lt;br&gt;
copying production data to it, conducting multiple performance test on the data). &amp;nbsp;&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3550779</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3550780</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">report</subfield>
All versions This version
Views 266266
Downloads 632632
Data volume 1.2 GB1.2 GB
Unique views 252252
Unique downloads 597597


Cite as