Dynamic Node Management and Measure Estimation in a State-Driven Fault Injector

14 years 5 months ago

Download suif.stanford.edu

Validation of distributed systems using fault injection is difﬁcult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a global state. To address these challenges, the Loki fault injector injects faults based on a partial view of the global state of a distributed system, and performs a post-runtime analysis using an off-line clock synchronization algorithm to determine whether the faults were properly injected. In this paper, we ﬁrst describe an enhanced runtime architecture for the Loki fault injector and then present a new method for obtaining measures in Loki. The enhanced runtime allows dynamic entry and exit of nodes in the system. It also offers more efﬁcient multicast of notiﬁcation messages and more efﬁcient communication between state machines on the same host, and is more scalable than the previous runtime. We then detail a new and ﬂexible method for obtaining a wide range of performance and dependability meas...

Ramesh Chandra, Michel Cukier, Ryan M. Lefever, Wi

Real-time Traffic

Distributed Systems | Fault Injection | Loki Fault Injector | Operating Systems | SRDS 2000 |

claim paper

Post Info
More Details (n/a)

Added	01 Aug 2010
Updated	01 Aug 2010
Type	Conference
Year	2000
Where	SRDS
Authors	Ramesh Chandra, Michel Cukier, Ryan M. Lefever, William H. Sanders

Comments (0)

Sciweavers

Dynamic Node Management and Measure Estimation in a State-Driven Fault Injector

Distributed Systems | Fault Injection | Loki Fault Injector | Operating Systems | SRDS 2000 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers