Sciweavers

2400 search results - page 89 / 480
» Systems Failures
Sort
View
SSS
2009
Springer
118views Control Systems» more  SSS 2009»
14 years 1 months ago
Brief Announcement: A Simple and Quiescent Omega Algorithm in the Crash-Recovery Model
We present a simple algorithm that implements the Omega failure detector in the crash-recovery model. The algorithm is quiescent, i.e., eventually all the processes but the leader ...
Cristian Martín, Mikel Larrea
SIGOPS
2010
91views more  SIGOPS 2010»
13 years 7 months ago
Why panic()?: improving reliability with restartable file systems
The file system is one of the most critical components of the operating system. Almost all applications running in the operating system require file systems to be available for ...
Swaminathan Sundararaman, Sriram Subramanian, Abhi...
SSS
2009
Springer
14 years 3 months ago
Stability of Distributed Algorithms in the Face of Incessant Faults
Abstract. For large distributed systems built from inexpensive components, one expects to see incessant failures. This paper proposes two models for such faults and analyzes two we...
Robert E. Lee DeVille, Sayan Mitra
ICWS
2009
IEEE
14 years 6 months ago
Scalable and Reliable Location Services through Decentralized Replication
—One of the critical challenges for service oriented computing systems is the capability to guarantee scalable and reliable service provision. This paper presents Reliable GeoGri...
Gong Zhang, Ling Liu, Sangeetha Seshadri, Bhuvan B...
GCC
2007
Springer
14 years 3 months ago
Spaces: Support for Decoupled Communication in Wide-Area Parallel Applications
Wide-area distributed systems like computational grids are emergent infrastructures for high-performance parallel applications. On these systems, communication mechanisms have to ...
Philip Chan, David Abramson