Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

180

GI
2004
Springer

113views Theoretical Computer Science» more GI 2004»

Crash Management for Distributed Parallel Systems

16 years 5 days ago

Crash Management for Distributed Parallel Systems

Download www.ti.informatik.uni-frankfurt.de

: With the growing complexity of parallel architectures, the probability of system failures grows, too. One approach to cope with this problem is the self-healing, one of the organic computing’s self-x features. Self-healing in this context means that computer clusters should detect and handle failures automatically. This paper presents a self-healing mechanism based on checkpointing, so that a cluster remains operative even if some sites or the connections between them fail. The proposed method has been implemented and tested on the Self Distributing Virtual Machine (SDVM).

Jan Haase, Frank Eschmann

Real-time Traffic

Computing’s Self-x Features | GI 2004 | Parallel Architectures | Self-healing | Theoretical Computer Science |

claim paper

Related Content

» Atomic Broadcast in Asynchronous CrashRecovery Distributed Systems

» CCS Resource Management in Networked HPC Systems

» Lazy Logging and PrefetchBased Crash Recovery in Software Distributed Shared Memory System...

» Network Storage Management in Data Grid Environment

» Optimizing crash dump in virtualized environments

» A FaultTolerant Scalable LowOverhead Distributed Garbage Detection Protocol

» On Detecting Termination in the CrashRecovery Model

» Crash fault detection in celerating environments

» An IT appliance for remote collaborative review of mechanisms of injury to children in mot...

Post Info
More Details (n/a)

Added	01 Jul 2010
Updated	01 Jul 2010
Type	Conference
Year	2004
Where	GI
Authors	Jan Haase, Frank Eschmann

Comments (0)