Search Sciweavers | Sciweavers

1256 search results - page 8 / 252

» On Coordinated Checkpointing in Distributed Systems

196

click to vote

SRDS
1999
IEEE

126views Operating System» more SRDS 1999»

An Adaptive Checkpointing Protocol to Bound Recovery Time with Message Logging

15 years 11 months ago

Download ssrnet.snu.ac.kr

Numerous mathematical approaches have been proposed to determine the optimal checkpoint interval for minimizing total execution time of an application in the presence of failures....

Kuo-Feng Ssu, Bin Yao, W. Kent Fuchs

claim paper

Read More »

241

Voted

SRDS
1999
IEEE

122views Operating System» more SRDS 1999»

Logging and Recovery in Adaptive Software Distributed Shared Memory Systems

15 years 11 months ago

Download www.cacs.louisiana.edu

Software distributed shared memory (DSM) improves the programmability of message-passing machines and workclusters by providing a shared memory abstract (i.e., a coherent global a...

Angkul Kongmunvattana, Nian-Feng Tzeng

claim paper

Read More »

248

click to vote

ICPP
2009
IEEE

185views Distributed And Parallel Com...» more ICPP 2009»

Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems

16 years 2 months ago

Download nowlab.cse.ohio-state.edu

—Clusters and applications continue to grow in size while their mean time between failure (MTBF) is getting smaller. Checkpoint/Restart is becoming increasingly important for lar...

Xiangyong Ouyang, Karthik Gopalakrishnan, Dhabales...

claim paper

Read More »

178

click to vote

ICECCS
1997
IEEE

92views Hardware» more ICECCS 1997»

Cache based fault recovery for distributed systems

15 years 11 months ago

Download www.deeds.informatik.tu-darmstadt.de

No cache based techniques for roll-forward fault recovery exist at present. A split-cache approach is proposed that provides e cient support for checkpointing and roll-forward fau...

Avi Mendelson, Neeraj Suri

claim paper

Read More »

217

click to vote

GRID
2004
Springer

112views Distributed And Parallel Com...» more GRID 2004»

Checkpoint and Restart for Distributed Components in XCAT3

16 years 26 days ago

Download www.extreme.indiana.edu

With the advent of Grid computing, more and more highend computational resources become available for use to a scientist. While this opens up new avenues for scientiﬁc research,...

Sriram Krishnan, Dennis Gannon

claim paper

Read More »

« Prev « First page 8 / 252 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers