Fault Injection based on a Partial View of the Global State of a Distributed System

14 years 4 months ago

Download suif.stanford.edu

Validating distributed systems is particularly difficult, since failures may occur due to a correlated occurrence of faults in different parts of the system. This paper describes the basis for and preliminary implementation of a new fault injector, called Loki, developed specifically for distributed systems. Loki addresses issues related to injecting correlated faults in distributed systems. In Loki, fault injection is performed based on a partial view of the global state of an application. In particular, facilities are provided to pass user-specified state information between nodes to provide a partial view of the global state in order to try to inject complex faults successfully. A post-runtime analysis, done using an off-line clock synchronization and a bounding technique, is used to place events and injections on a single global timeline and determine whether the intended faults were properly injected. Finally, observations containing successful fault injections are used to estima...

Michel Cukier, Ramesh Chandra, David Henke, Jessic

Real-time Traffic

Complex Faults | Distributed Systems | Faults | Operating Systems | SRDS 1999 |

claim paper

Post Info
More Details (n/a)

Added	04 Aug 2010
Updated	04 Aug 2010
Type	Conference
Year	1999
Where	SRDS
Authors	Michel Cukier, Ramesh Chandra, David Henke, Jessica Pistole, William H. Sanders

Comments (0)

Sciweavers

Fault Injection based on a Partial View of the Global State of a Distributed System

Complex Faults | Distributed Systems | Faults | Operating Systems | SRDS 1999 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers