Lookup NU author(s): Dr Paul Ezhilchelvan
Full text is not currently available for this publication.
This thesis describes the design and development of algorithms for fault tolerant distributed systems. The development of such algorithms requires making assumptions about the types of component faults for which tolerance is provided. Such assumptions must be specified accurately. To this end, this thesisdevelops a classification of faults in systems. This fault classification identifies a range of fault types from the most restricted to the least restricted. For each fault type, an algorithm for reaching distributed agreement in the presence of a bounded number of faulty processors is developed and thus a family of agreement algorithms is presented. The influence of the various fault types on the complexities of these algorithms is discussed. early stopping algorithms are also developed for selected fault types and the influence of fault types on the early stopping conditions of the respective algorithms is analysed. the problem of evaluating the performance of distributed replicated systems which will require agreement algorithms is considered next. As a first step in the direction of meeting this challenging task, a pipeline triple modular redundant system is considered and analytical methods are derived to evaluate performance of such a system. Finally, the accuracy of these methods is examined using computer simulations.
Author(s): Ezhilchelvan PD
Publication type: Report
Publication status: Published
Series Title: Computing Laboratory
Print publication date: 01/01/1989
Source Publication Date: 1989
Institution: University of Newcastle upon Tyne
Place Published: Newcastle upon Tyne
Notes: British Lending Library DSC stock location number: DX88807