Sciweavers

483 search results - page 2 / 97
» Fault Management in P2P-MPI
Sort
View
CLUSTER
2002
IEEE
14 years 12 days ago
Design and Validation of Portable Communication Infrastructure for Fault-Tolerant Cluster Middleware
We describe the communication infrastructure (CI) for our fault-tolerant cluster middleware, which is optimized for two classes of communication: for the applications and for the ...
Ming Li, Wenchao Tao, Daniel Goldberg, Israel Hsu,...
NOMS
2008
IEEE
14 years 1 months ago
Distributed fault correlation scheme using a semantic publish/subscribe system
—Increasingly there is a demand for more scalable fault management schemes to cope with the ever increasing growth and complexity of modern networks. Current distributed fault co...
Wei Tai, Declan O'Sullivan, John Keeney
IM
2003
13 years 8 months ago
Toward Understanding Soft Faults in High Performance Cluster Networks
: Fault management in high performance cluster networks has been focused on the notion of hard faults (i.e., link or node failures). Network degradations that negatively impact per...
Jeffrey J. Evans, Seongbok Baik, Cynthia S. Hood, ...
SRDS
2000
IEEE
13 years 11 months ago
Dynamic Node Management and Measure Estimation in a State-Driven Fault Injector
Validation of distributed systems using fault injection is difficult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a gl...
Ramesh Chandra, Michel Cukier, Ryan M. Lefever, Wi...
ISSRE
2008
IEEE
14 years 1 months ago
The Effect of the Number of Defects on Estimates Produced by Capture-Recapture Models
Project managers use inspection data as input to capture-recapture (CR) models to estimate the total number of faults present in a software artifact. The CR models use the number ...
Gursimran Singh Walia, Jeffrey C. Carver