Sciweavers

43 search results - page 4 / 9
» Transparent disconnected operation for fault-tolerance
Sort
View
DAIS
2006
14 years 8 days ago
Fault-Tolerant Replication Based on Fragmented Objects
This paper describes a novel approach to fault-tolerance in distributed object-based systems. It uses the fragmented-object model to integrate replication mechanisms into distribut...
Hans P. Reiser, Rüdiger Kapitza, Jörg Do...
ICS
2007
Tsinghua U.
14 years 5 months ago
Proactive fault tolerance for HPC with Xen virtualization
Large-scale parallel computing is relying increasingly on clusters with thousands of processors. At such large counts of compute nodes, faults are becoming common place. Current t...
Arun Babu Nagarajan, Frank Mueller, Christian Enge...
CLUSTER
2003
IEEE
14 years 4 months ago
Coordinated Checkpoint versus Message Log for Fault Tolerant MPI
— Large Clusters, high availability clusters and Grid deployments often suffer from network, node or operating system faults and thus require the use of fault tolerant programmin...
Aurelien Bouteiller, Pierre Lemarinier, Gér...
IPPS
2002
IEEE
14 years 3 months ago
Disconnected Operations in Mobile Environments
The execution of distributed applications involving mobile terminals and fixed servers connected by wireless links raises the need for handling network disconnections, both invol...
Denis Conan, Sophie Chabridon, Guy Bernard
SIGOPSE
1990
ACM
14 years 2 months ago
Transparent disconnected operation for fault-tolerance
James J. Kistler, Mahadev Satyanarayanan