Sciweavers

2400 search results - page 27 / 480
» Systems Failures
Sort
View
OSDI
2000
ACM
13 years 9 months ago
Exploring Failure Transparency and the Limits of Generic Recovery
: We explore the abstraction of failure transparency in which the operating system provides the illusion of failure-free operation. To provide failure transparency, an operating sy...
David E. Lowell, Subhachandra Chandra, Peter M. Ch...
NOMS
2008
IEEE
132views Communications» more  NOMS 2008»
14 years 2 months ago
DYSWIS: An architecture for automated diagnosis of networks
As the complexity of networked systems increases, we need mechanisms to automatically detect failures in the network and diagnose the cause of such failures. To realize true self-...
Vishal Kumar Singh, Henning Schulzrinne, Kai Miao
3PGCIC
2010
13 years 5 months ago
Using a Failure History Service for Reliable Grid Node Information
The need for reliability in Grid Systems is a difficult challenge which is very important in the context of highly dynamic systems composed of thousands of nodes. Failure manageme...
Catalin Leordeanu, Valentin Cristea, Thomas Ropars...
SASO
2007
IEEE
14 years 1 months ago
Root Cause Isolation for Self Healing in J2EE Environments
— The increasing complexity of distributed enterprise systems has made the task of managing these systems difficult and time consuming. The only way to simplify the management p...
Umesh Bellur, Amar Agrawal
PDP
2002
IEEE
14 years 16 days ago
Eventually Consistent Failure Detectors
The concept of unreliable failure detector was introduced by Chandra and Toueg as a mechanism that provides information about process failures. This mechanism has been used to sol...
Mikel Larrea, Antonio Fernández, Sergio Ar&...