Published September 17, 2019 | Version v1
Conference paper Open

Dealing with Hardware Faults in Energy-Efficient Static Schedules of Multi-Variant Programs on Heterogeneous Platforms

  • 1. Dept. of Computer and Information Science (IDA) Linköping University
  • 2. Faculty of Mathematics and Computer Science Fern Universität in Hagen

Description

We investigate the energy-efficient execution of programs with a sequence of program parts, each part executable by multiple variants on different execution units. We study their behaviour under the presence of crash faults on a computing platform with heterogeneous execution units like multicore, GPU, and FPGA. To this end, we extend a static scheduling algorithm for computing the sequence of variants leading to minimum runtime, minimum energy consumption, or a weighted sum of both, to consider cases where one or more program variants cannot be used anymore from some execution point on, due to failure of the underlying execution unit(s). This extension combines the advantageous results of static scheduling, known in the fault-free case, with avoidance of overhead for re-scheduling in case of a fault. We evaluate our algorithm with synthetically generated progam task graphs. The results indicate that, compared to computing a new schedule for each fault case, our algorithm only needs 55% of the scheduling time for 8 variants.

Notes

https://ieeexplore.ieee.org/document/8836192 Published in: ARCS Workshop 2019; 32nd International Conference on Architecture of Computing Systems

Files

2_dealing_with.pdf

Files (202.5 kB)

Name Size Download all
md5:e96f4c5db5ac0db2113b437728229d6e
202.5 kB Preview Download

Additional details

Funding

EXA2PRO – Enhancing Programmability and boosting Performance Portability for Exascale Computing Systems 801015
European Commission