Search Sciweavers | Sciweavers

342 search results - page 12 / 69

» A planning based approach to failure recovery in distributed...

click to vote

SRDS
1994
IEEE

120views Operating System» more SRDS 1994»

Coordinated Checkpointing-Rollback Error Recovery for Distributed Shared Memory Multicomputers

13 years 11 months ago

Download fmdb.cs.ucla.edu

Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require unnecessarily high checkpointing frequency and checkpoint traffic, which are sens...

G. Janakiraman, Yuval Tamir

claim paper

Read More »

click to vote

SRDS
2008
IEEE

100views Operating System» more SRDS 2008»

Dynamically Quantifying and Improving the Reliability of Distributed Storage Systems

14 years 1 months ago

Download vivo.cs.rutgers.edu

In this paper, we argue that the reliability of large-scale storage systems can be signiﬁcantly improved by using better reliability metrics and more efﬁcient policies for rec...

Rekha Bachwani, Leszek Gryz, Ricardo Bianchini, Ce...

claim paper

Read More »

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

A pareto archive evolutionary strategy based radial basis function neural network training algorithm for failure rate prediction

14 years 1 months ago

Download www.cs.bham.ac.uk

This paper outlines a radial basis function neural network approach to predict the failures in overhead distribution lines of power delivery systems. The RBF networks are trained ...

Grant Cochenour, Jerad Simon, Sanjoy Das, Anil Pah...

claim paper

Read More »

click to vote

NSDI
2004

119views Computer Networks» more NSDI 2004»

Path-Based Failure and Evolution Management

13 years 9 months ago

Download www.cs.berkeley.edu

We present a new approach to managing failures and evolution in large, complex distributed systems using runtime paths. We use the paths that requests follow as e through the syst...

Mike Y. Chen, Anthony Accardi, Emre Kiciman, David...

claim paper

Read More »

click to vote

PVM
2010
Springer

123views Distributed And Parallel Com...» more PVM 2010»

Dodging the Cost of Unavoidable Memory Copies in Message Logging Protocols

13 years 6 months ago

Download icl.cs.utk.edu

Abstract. With the number of computing elements spiraling to hundred of thousands in modern HPC systems, failures are common events. Few applications are nevertheless fault toleran...

George Bosilca, Aurelien Bouteiller, Thomas H&eacu...

claim paper

Read More »

« Prev « First page 12 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers