Sciweavers

231 search results - page 26 / 47
» Asynchronous failure detectors
Sort
View
ICDCS
2005
IEEE
14 years 2 months ago
The Impossibility of Boosting Distributed Service Resilience
We prove two theorems saying that no distributed system in which processes coordinate using reliable registers and -resilient services can solve the consensus problem in the prese...
Paul C. Attie, Rachid Guerraoui, Petr Kouznetsov, ...
IPPS
2009
IEEE
14 years 3 months ago
Compiler-enhanced incremental checkpointing for OpenMP applications
As modern supercomputing systems reach the peta-flop performance range, they grow in both size and complexity. This makes them increasingly vulnerable to failures from a variety ...
Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...
GCC
2007
Springer
14 years 2 months ago
Spaces: Support for Decoupled Communication in Wide-Area Parallel Applications
Wide-area distributed systems like computational grids are emergent infrastructures for high-performance parallel applications. On these systems, communication mechanisms have to ...
Philip Chan, David Abramson
LCPC
2007
Springer
14 years 2 months ago
Compiler-Enhanced Incremental Checkpointing
As modern supercomputing systems reach the peta-flop performance range, they grow in both size and complexity. This makes them increasingly vulnerable to failures from a variety o...
Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...
PDP
2006
IEEE
14 years 2 months ago
A B2B Distributed Replication Service
A deadlock free distributed replication service for B2B CORBA based applications is presented. This service provides persistent storage for commercial transactions performed by B2...
José Javier Astrain, Alberto Córdoba...