Report Open Access
Nazerke Seidan
Erasure coding, a new feature in HDFS, can reduce storage overhead by approximately 50%
compared to replication while maintaining the same durability guarantees. This would allow to
save a lot of disk capacity in needed by project hosted in CERN IT Hadoop service. The goal of
the project is to evaluate the new features of Hadoop 3 and make an assessment of its readiness
for production systems (this includes installation and configuration of a test hadoop3 cluster,
copying production data to it, conducting multiple performance test on the data).
Name | Size | |
---|---|---|
Report_Nazerke_Seidan.pdf
md5:0898929539159c205f98bd148bf6cbc9 |
1.9 MB | Download |
All versions | This version | |
---|---|---|
Views | 264 | 264 |
Downloads | 627 | 627 |
Data volume | 1.2 GB | 1.2 GB |
Unique views | 250 | 250 |
Unique downloads | 592 | 592 |