Sciweavers

464 search results - page 53 / 93
» A Fault Tolerance Protocol with Fast Fault Recovery
Sort
View
SRDS
1998
IEEE
13 years 12 months ago
System-Level Versus User-Defined Checkpointing
Checkpointing and rollback recovery is a very effective technique to tolerate transient faults and preventive shutdowns. In the past, most of the checkpointing schemes published i...
Luís Moura Silva, João Gabriel Silva
SOSP
2005
ACM
14 years 4 months ago
Fault-scalable Byzantine fault-tolerant services
A fault-scalable service can be configured to tolerate increasing numbers of faults without significant decreases in performance. The Query/Update (Q/U) protocol is a new tool t...
Michael Abd-El-Malek, Gregory R. Ganger, Garth R. ...
CCGRID
2006
IEEE
14 years 1 months ago
MPI-Mitten: Enabling Migration Technology in MPI
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents ...
Cong Du, Xian-He Sun
GPC
2007
Springer
14 years 1 months ago
Fault Management in P2P-MPI
We present in this paper the recent developments done in P2P-MPI, a grid middleware, concerning the fault management, which covers fault-tolerance for applications and fault detect...
Stéphane Genaud, Choopan Rattanapoka
ISPDC
2006
IEEE
14 years 1 months ago
MojaveFS: Providing Sequential Consistency in a Distributed Objects System
This paper presents MojaveFS, a distributed file system with support for sequential consistency. It provides location transparency and makes use of replication for reliability an...
Cristian Tapus, David A. Noblet, Vlad Grama, Jason...