Sciweavers

354 search results - page 5 / 71
» Self Adaptive Application Level Fault Tolerance for Parallel...
Sort
View
ICPP
2009
IEEE
14 years 3 months ago
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems
—Considerable work has been done on providing fault tolerance capabilities for different software components on largescale high-end computing systems. Thus far, however, these fa...
Rinku Gupta, Pete Beckman, Byung-Hoon Park, Ewing ...
CLUSTER
2004
IEEE
13 years 8 months ago
MPI/FT: A Model-Based Approach to Low-Overhead Fault Tolerant Message-Passing Middleware
Fault tolerance in parallel systems has traditionally been achieved through a combination of redundancy and checkpointing methods. This notion has also been extended to message-pas...
Rajanikanth Batchu, Yoginder S. Dandass, Anthony S...
CLUSTER
2006
IEEE
14 years 2 months ago
FAIL-MPI: How Fault-Tolerant Is Fault-Tolerant MPI?
One of the topics of paramount importance in the development of Cluster and Grid middleware is the impact of faults since their occurrence in Grid infrastructures and in large-sca...
William Hoarau, Pierre Lemarinier, Thomas Hé...
IPPS
2002
IEEE
14 years 1 months ago
Design and Implementation of a Pluggable Fault Tolerant CORBA Infrastructure
In this paper we present the design and implementation of a Pluggable Fault Tolerant CORBA Infrastructure that provides fault tolerance for CORBA applications by utilizing the plu...
Wenbing Zhao, Louise E. Moser, P. M. Melliar-Smith
MIDDLEWARE
2000
Springer
14 years 4 days ago
Gateways for Accessing Fault Tolerance Domains
Enterprise applications can be structured as domains, where each domain contains objects that are replicated for fault tolerance, with the replication being managed by a fault tole...
Priya Narasimhan, Louise E. Moser, P. M. Melliar-S...