Search Sciweavers | Sciweavers

48 search results - page 6 / 10

» Self-stabilizing algorithm for checkpointing in a distribute...

click to vote

ICS
2004
Tsinghua U.

140views Distributed And Parallel Com...» more ICS 2004»

Adaptive incremental checkpointing for massively parallel systems

14 years 1 months ago

Download www.eecs.harvard.edu

Given the scale of massively parallel systems, occurrence of faults is no longer an exception but a regular event. Periodic checkpointing is becoming increasingly important in the...

Saurabh Agarwal, Rahul Garg, Meeta Sharma Gupta, J...

claim paper

Read More »

click to vote

ICS
2011
Tsinghua U.

278views Distributed And Parallel Com...» more ICS 2011»

High performance linpack benchmark: a fault tolerant implementation without checkpointing

12 years 11 months ago

Download inside.mines.edu

The probability that a failure will occur before the end of the computation increases as the number of processors used in a high performance computing application increases. For l...

Teresa Davies, Christer Karlsson, Hui Liu, Chong D...

claim paper

Read More »

click to vote

SC
2009
ACM

254views Applied Computing» more SC 2009»

FALCON: a system for reliable checkpoint recovery in shared grid environments

14 years 2 months ago

Download cobweb.ecn.purdue.edu

In Fine-Grained Cycle Sharing (FGCS) systems, machine owners voluntarily share their unused CPU cycles with guest jobs, as long as the performance degradation is tolerable. For gu...

Tanzima Zerin Islam, Saurabh Bagchi, Rudolf Eigenm...

claim paper

Read More »

click to vote

VEE
2006
ACM

126views Virtualization» more VEE 2006»

A new approach to real-time checkpointing

14 years 1 months ago

Download www.cs.purdue.edu

The progress towards programming methodologies that simplify the work of the programmer involves automating, whenever possible, activities that are secondary to the main task of d...

Antonio Cunei, Jan Vitek

claim paper

Read More »

click to vote

SIGSOFT
2007
ACM

138views Software Engineering» more SIGSOFT 2007»

Efficient checkpointing of java software using context-sensitive capture and replay

14 years 8 months ago

Download www.cse.ohio-state.edu

Checkpointing and replaying is an attractive technique that has been used widely at the operating/runtime system level to provide fault tolerance. Applying such a technique at the...

Guoqing Xu, Atanas Rountev, Yan Tang, Feng Qin

claim paper

Read More »

« Prev « First page 6 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers