Sciweavers

441 search results - page 14 / 89
» Generic Timing Fault Tolerance using a Timely Computing Base
Sort
View
AINA
2004
IEEE
14 years 14 days ago
Region-based Stage Construction Protocol for Fault tolerant Execution of Mobile Agent
Fault tolerance is essential to the development of reliable mobile agent system in order to guarantee continuous execution of mobile agents. For this purpose, some previous works ...
SungJin Choi, MaengSoon Baik, HongSoo Kim, JunWeon...
CLUSTER
2004
IEEE
14 years 14 days ago
FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI
As high performance clusters continue to grow in size, the mean time between failure shrinks. Thus, the issues of fault tolerance and reliability are becoming one of the challengi...
Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé
ESCIENCE
2007
IEEE
14 years 3 months ago
Intelligent Selection of Fault Tolerance Techniques on the Grid
The emergence of computational grids has lead to an increased reliance on task schedulers that can guarantee the completion of tasks that are executed on unreliable systems. There...
Daniel C. Vanderster, Nikitas J. Dimopoulos, Randa...
HPCC
2010
Springer
13 years 9 months ago
A Generic Execution Management Framework for Scientific Applications
Managing the execution of scientific applications in a heterogeneous grid computing environment can be a daunting task, particularly for long running jobs. Increasing fault tolera...
Tanvire Elahi, Cameron Kiddle, Rob Simmonds
ICS
2011
Tsinghua U.
13 years 5 days ago
High performance linpack benchmark: a fault tolerant implementation without checkpointing
The probability that a failure will occur before the end of the computation increases as the number of processors used in a high performance computing application increases. For l...
Teresa Davies, Christer Karlsson, Hui Liu, Chong D...