Toggle Main Menu Toggle Search

Open Access padlockePrints

Analysis of restart mechanisms in software systems

Lookup NU author(s): Professor Aad van Moorsel, Dr Katinka Wolter

Downloads


Abstract

Restarts or retries are a common phenomenon in computing systems, for instance, in preventive maintenance, software rejuvenation, or when a failure is suspected. Typically, one sets a time-out to trigger the restart. We analyze and optimize time-out strategies for scenarios in which the expected required remaining time of a task is not always decreasing with the time invested in it. Examples of such tasks include the download of Web pages, randomized algorithms, distributed queries, and jobs subject to network or other failures. Assuming the independence of the completion time of successive tries, we derive computationally attractive expressions for the moments of the completion time, as well as for the probability that a task is able to meet a deadline. These expressions facilitate efficient algorithms to compute optimal restart strategies and are promising candidates for pragmatic online optimization of restart timers. © 2006 IEEE.


Publication metadata

Author(s): van Moorsel A, Wolter K

Publication type: Article

Publication status: Published

Journal: IEEE Transactions on Software Engineering

Year: 2006

Volume: 32

Issue: 8

Pages: 547-558

Date deposited: 07/07/2010

ISSN (print): 0098-5589

ISSN (electronic): 1939-3520

Publisher: IEEE

URL: http://dx.doi.org/10.1109/TSE.2006.73

DOI: 10.1109/TSE.2006.73


Altmetrics

Altmetrics provided by Altmetric


Share