Conference paper Open Access

Optimum Checkpointing for Long-running Programs

Siavvas, Miltiadis; Gelenbe, Erol

Checkpoints are widely used to improve the performance of computer systems and programs in the presence of failures, and significantly reduce the cost of restarting a program each time that it fails. Application level checkpointing has been proposed for programs which may execute on platforms which are prone to failures, and also to reduce the execution time of programs which are prone to internal failures. Thus we propose a mathematical model to estimate the average execution time of a program that operates in the presence of dependability failures, without and with application level checkpointing, and use it to estimate the optimum interval in number of instructions executed between successive checkpoints. Specific emphasis is given on programs with loops, whereas the results are illustrated through simulation.

Files (669.9 kB)
Name Size
CEISEE_2019____Checkpointing_Paper.pdf
md5:694bbec0ae23ec117062135057f5feca
669.9 kB Download
32
20
views
downloads
All versions This version
Views 3232
Downloads 2020
Data volume 13.4 MB13.4 MB
Unique views 2828
Unique downloads 1616

Share

Cite as