Published November 22, 2019 | Version v1
Report Open

Evaluation of Erasure Coding and other features of Hadoop 3

Creators

Description

Erasure coding, a new feature in HDFS, can reduce storage overhead by approximately 50% 
compared to replication while maintaining the same durability guarantees. This would allow to 
save a lot of disk capacity in needed by project hosted in CERN IT Hadoop service. The goal of 
the project is to evaluate the new features of Hadoop 3 and make an assessment of its readiness 
for production systems (this includes installation and configuration of a test hadoop3 cluster, 
copying production data to it, conducting multiple performance test on the data).  

Files

Report_Nazerke_Seidan.pdf

Files (1.9 MB)

Name Size Download all
md5:0898929539159c205f98bd148bf6cbc9
1.9 MB Preview Download