Sciweavers

483 search results - page 26 / 97
» Fault Management in P2P-MPI
Sort
View
TDSC
2010
146views more  TDSC 2010»
13 years 3 months ago
Fault Localization via Risk Modeling
Automated, rapid, and effective fault management is a central goal of large operational IP networks. Today's networks suffer from a wide and volatile set of failure modes, wh...
Ramana Rao Kompella, Jennifer Yates, Albert G. Gre...
AIMS
2010
Springer
14 years 14 days ago
Probabilistic Fault Diagnosis in the MAGNETO Autonomic Control Loop
Management of outer edge domains is a big challenge for service providers due to the diversity, heterogeneity and large amount of such networks, together with limited visibility on...
Pablo Arozarena, Raquel Toribio, Jesse Kielthy, Ke...
SC
2000
ACM
14 years 29 days ago
Scalable Fault-Tolerant Distributed Shared Memory
This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be efficiently extended to tolerate single-node failures. In particular, we extend a ...
Florin Sultan, Thu D. Nguyen, Liviu Iftode
SOSP
2001
ACM
14 years 5 months ago
BASE: Using Abstraction to Improve Fault Tolerance
ing Abstraction to Improve Fault Tolerance MIGUEL CASTRO Microsoft Research and RODRIGO RODRIGUES and BARBARA LISKOV MIT Laboratory for Computer Science Software errors are a major...
Rodrigo Rodrigues, Miguel Castro, Barbara Liskov
MINENET
2005
ACM
14 years 2 months ago
Shrink: a tool for failure diagnosis in IP networks
Faults in an IP network have various causes such as the failure of one or more routers at the IP layer, fiber-cuts, failure of physical elements at the optical layer, or extraneo...
Srikanth Kandula, Dina Katabi, Jean-Philippe Vasse...