Sciweavers

106 search results - page 10 / 22
» Transparent Fault Tolerance for Grid Applications
Sort
View
140
Voted
DSN
2003
IEEE
15 years 10 months ago
Design and Implementation of a Consistent Time Service for Fault-Tolerant Distributed Systems
Clock-related operations are one of the many sources of replica non-determinism and of replica inconsistency in fault-tolerant distributed systems. In passive replication, if the ...
Wenbing Zhao, Louise E. Moser, P. M. Melliar-Smith
DSN
2003
IEEE
15 years 10 months ago
Engineering Fault-Tolerant TCP/IP Servers Using FT-TCP
In a recent paper [2] we have proposed FT-TCP: an architecture that allows a replicated service to survive crashes without breaking its TCP connections. FT-TCP is attractive in pr...
Dmitrii Zagorodnov, Keith Marzullo, Lorenzo Alvisi...
DAIS
2006
15 years 6 months ago
Fault-Tolerant Replication Based on Fragmented Objects
This paper describes a novel approach to fault-tolerance in distributed object-based systems. It uses the fragmented-object model to integrate replication mechanisms into distribut...
Hans P. Reiser, Rüdiger Kapitza, Jörg Do...
142
Voted
DSN
2003
IEEE
15 years 10 months ago
TCP Server Fault Tolerance Using Connection Migration to a Backup Server
This paper describes the design, implementation, and performance evaluation of ST-TCP (Server fault-Tolerant TCP), which is an extension of TCP to tolerate TCP server failures. Th...
Manish Marwah, Shivakant Mishra, Christof Fetzer
CLUSTER
2004
IEEE
15 years 8 months ago
Improved message logging versus improved coordinated checkpointing for fault tolerant MPI
Fault tolerance is a very important concern for critical high performance applications using the MPI library. Several protocols provide automatic and transparent fault detection a...
Pierre Lemarinier, Aurelien Bouteiller, Thomas H&e...