Report Open Access

Evaluation of Erasure Coding and other features of Hadoop 3

Nazerke Seidan

Erasure coding, a new feature in HDFS, can reduce storage overhead by approximately 50% 
compared to replication while maintaining the same durability guarantees. This would allow to 
save a lot of disk capacity in needed by project hosted in CERN IT Hadoop service. The goal of 
the project is to evaluate the new features of Hadoop 3 and make an assessment of its readiness 
for production systems (this includes installation and configuration of a test hadoop3 cluster, 
copying production data to it, conducting multiple performance test on the data).  

Files (1.9 MB)
Name Size
Report_Nazerke_Seidan.pdf
md5:0898929539159c205f98bd148bf6cbc9
1.9 MB Download
0
0
views
downloads
All versions This version
Views 00
Downloads 00
Data volume 0 Bytes0 Bytes
Unique views 00
Unique downloads 00

Share

Cite as