Toggle Main Menu Toggle Search

Open Access padlockePrints

Reliable Computing Systems

Lookup NU author(s): Professor Brian RandellORCiD, Professor Pete Lee

Downloads


Abstract

The paper presents an analysis of the various problems involved in acheiving very high reliablility from complex computing systems, and discusses the relationship between system structuring techniques and techniques of fault tolerance. Topics covered include (i) differeing types of reliability requirement, (ii) forms of protective redundancy in hardware and software systems, (iii) methods of structuring the activity of a system, using atomic actions, so as to limit information flow, (iv) error detection techniques, (v) strategies for locating and dealing with faults, and for assessing the damage they have caused, and (vi) forward and backward error recovery techniques, based on the concepts of recovery line, commitment, exception and compensation. A set of appendices provide summary descriptions and analyses of a number of computing systems that have been specifically designed with the aim of acheiving very high reliablility.


Publication metadata

Author(s): Randell B, Lee PA, Treleavan PC

Series Editor(s): Shaw B

Publication type: Report

Publication status: Published

Series Title: Computing Laboratory Technical Report Series

Year: 1977

Pages: [118]

Print publication date: 01/05/1977

Source Publication Date: May 1977

Report Number: 102

Institution: Computing Laboratory, The University of Newcastle upon Tyne

Place Published: Newcastle upon Tyne


Share