Search Sciweavers | Sciweavers

1256 search results - page 3 / 252

» On Coordinated Checkpointing in Distributed Systems

218

Voted

ICDCS
2005
IEEE

116views Distributed And Parallel Com...» more ICDCS 2005»

Optimal Asynchronous Garbage Collection for RDT Checkpointing Protocols

16 years 1 months ago

Download www.cecs.uci.edu

Communication-induced checkpointing protocols that ensure rollback-dependency trackability (RDT) guarantee important properties to the recovery system without explicit coordinatio...

Rodrigo Schmidt, Islene C. Garcia, Fernando Pedone...

claim paper

Read More »

216

click to vote

CLUSTER
2004
IEEE

180views Distributed And Parallel Com...» more CLUSTER 2004»

Improved message logging versus improved coordinated checkpointing for fault tolerant MPI

15 years 11 months ago

Download www.cs.utk.edu

Fault tolerance is a very important concern for critical high performance applications using the MPI library. Several protocols provide automatic and transparent fault detection a...

Pierre Lemarinier, Aurelien Bouteiller, Thomas H&e...

claim paper

Read More »

188

click to vote

PVLDB
2008

110views more PVLDB 2008»

Fault-tolerant stream processing using a distributed, replicated file system

15 years 6 months ago

Download www.cs.washington.edu

We present SGuard, a new fault-tolerance technique for distributed stream processing engines (SPEs) running in clusters of commodity servers. SGuard is less disruptive to normal s...

YongChul Kwon, Magdalena Balazinska, Albert G. Gre...

claim paper

Read More »

196

Voted

IPPS
1999
IEEE

126views Distributed And Parallel Com...» more IPPS 1999»

The Performance of Coordinated and Independent Checkpointing

15 years 11 months ago

Download ipdps.cc.gatech.edu

Checkpointing is a very effective technique to tolerate the occurrence of failures in distributed and parallel applications. The existing algorithms in the literature are basicall...

Luís Moura Silva, João Gabriel Silva

claim paper

Read More »

210

Voted

IPPS
2006
IEEE

106views Distributed And Parallel Com...» more IPPS 2006»

Coordinated checkpoint from message payload in pessimistic sender-based message logging

16 years 1 months ago

Download www.cecs.uci.edu

Execution of MPI applications on Clusters and Grid deployments suffers from node and network failure that motivates the use of fault tolerant MPI implementations. Two category tec...

M. Aminian, Mohammad K. Akbari, Bahman Javadi

claim paper

Read More »

« Prev « First page 3 / 252 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers