How to Recover Efficiently and Asynchronously when Optimism Fails

14 years 7 months ago

Download www.cse.iitb.ac.in

We propose a new algorithm for recovering asynchronously from failures in a distributed computation. Our algorithm is based on two novel concepts - a fault-tolerant vector clock to maintain causality information in spite of failures, and a history mechanism to detect orphan states and obsolete messages. These two mechanisms together with checkpointing and message-loggingare used to restore the system to a consistent state after a failure of one or more processes. Our algorithm is completely asynchronous. It handles multiple failures, does not assume any message ordering, causes the minimum amount of rollback and restores the maximum recoverable state with low overhead. Earlier optimistic protocols lack one or more of the above properties.

Om P. Damani, Vijay K. Garg

Real-time Traffic

Algorithm | Distributed And Parallel Computing | Fault-tolerant Vector Clock | ICDCS 1996 | Maximum Recoverable State |

claim paper

Post Info
More Details (n/a)

Added	07 Aug 2010
Updated	07 Aug 2010
Type	Conference
Year	1996
Where	ICDCS
Authors	Om P. Damani, Vijay K. Garg

Comments (0)

Sciweavers

How to Recover Efficiently and Asynchronously when Optimism Fails

Algorithm | Distributed And Parallel Computing | Fault-tolerant Vector Clock | ICDCS 1996 | Maximum Recoverable State |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers