Conference paper Open Access

Chrysaor: Fine-Grained, Fault-Tolerant Cloud-of-Clouds MapReduce

Costa, Pedro A. R. S.; Ramos, Fernando M. V.; Correia, Miguel

DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<identifier identifierType="DOI">10.5281/zenodo.814856</identifier>
<creators>
<creator>
<creatorName>Costa, Pedro A. R. S.</creatorName>
<givenName>Pedro A. R. S.</givenName>
<familyName>Costa</familyName>
</creator>
<creator>
<creatorName>Ramos, Fernando M. V.</creatorName>
<givenName>Fernando M. V.</givenName>
<familyName>Ramos</familyName>
</creator>
<creator>
<creatorName>Correia, Miguel</creatorName>
<givenName>Miguel</givenName>
<familyName>Correia</familyName>
</creator>
</creators>
<titles>
<title>Chrysaor: Fine-Grained, Fault-Tolerant Cloud-of-Clouds MapReduce</title>
</titles>
<publisher>Zenodo</publisher>
<publicationYear>2017</publicationYear>
<dates>
<date dateType="Issued">2017-05-14</date>
</dates>
<resourceType resourceTypeGeneral="Text">Conference paper</resourceType>
<alternateIdentifiers>
<alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/814856</alternateIdentifier>
</alternateIdentifiers>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementedBy">10.5281/zenodo.897490</relatedIdentifier>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.814855</relatedIdentifier>
<relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/supercloud</relatedIdentifier>
</relatedIdentifiers>
<rightsList>
<rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
</rightsList>
<descriptions>
<description descriptionType="Abstract">&lt;p&gt;MapReduce is a framework for processing large data sets much used in the context of cloud computing. MapReduce implementations like Hadoop can tolerate crashes and file corruptions, but not arbitrary faults. Unfortunately, there is evidence that arbitrary faults do occur and can affect the correctness of MapReduce job executions. Furthermore, many outages of major cloud offerings have been reported, raising concerns about the dependence on a single cloud. In this paper we propose a novel execution system that allows to scale out MapReduce computations to a cloud-of-clouds and tolerate arbitrary faults, malicious faults, and cloud outages. Our system, Chrysaor, is based on a fine-grained replication scheme that tolerates faults at the task level. Our solution has three important properties: it tolerates the above-mentioned classes of faults at reasonable cost; it requires minimal modifications to the users’ applications; and it does not involve changes to the Hadoop source code.We performed an extensive evaluation of our system in Amazon EC2, showing that our fine-grained solution is efficient in terms of computation by recovering only faulty tasks. This is achieved without incurring a significant penalty for the baseline case (i.e., without faults) in most workloads.&lt;/p&gt;</description>
</descriptions>
<fundingReferences>
<fundingReference>
<funderName>European Commission</funderName>
<funderIdentifier funderIdentifierType="Crossref Funder ID">10.13039/501100000780</funderIdentifier>
<awardNumber awardURI="info:eu-repo/grantAgreement/EC/H2020/643964/">643964</awardNumber>
<awardTitle>USER-CENTRIC MANAGEMENT OF SECURITY AND DEPENDABILITY IN CLOUDS OF CLOUDS</awardTitle>
</fundingReference>
</fundingReferences>
</resource>

78
69
views