Sciweavers

2498 search results - page 140 / 500
» Software Fault Tolerance
Sort
View
IPPS
2007
IEEE
14 years 4 months ago
A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance
Chao Wang, Frank Mueller, Christian Engelmann, Ste...
SCCC
2007
IEEE
14 years 4 months ago
A Relaxed-Ring for Self-Organising and Fault-Tolerant Peer-to-Peer Networks
There is no doubt about the increase in popularity of decentralised systems over the classical client-server architecture in distributed applications. These systems are developed ...
Boris Mejías, Peter Van Roy
HIPC
2007
Springer
14 years 4 months ago
A Scalable Asynchronous Replication-Based Strategy for Fault Tolerant MPI Applications
As computational clusters increase in size, their mean-time-to-failure reduces. Typically checkpointing is used to minimize the loss of computation. Most checkpointing techniques, ...
John Paul Walters, Vipin Chaudhary