Sciweavers

551 search results - page 13 / 111
» Software Implemented Fault Tolerance Technologies and Experi...
Sort
View
CCGRID
2006
IEEE
14 years 1 months ago
MPI-Mitten: Enabling Migration Technology in MPI
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents ...
Cong Du, Xian-He Sun
SAC
2008
ACM
13 years 7 months ago
Providing dependability for web services
Web services have been widely employed to allow interoperability among applications and/or technologies. However, the standard technologies and protocols which provide the foundat...
Jeferson L. R. Souza, Frank Siqueira
SC
2005
ACM
14 years 1 months ago
Transparent, Incremental Checkpointing at Kernel Level: a Foundation for Fault Tolerance for Parallel Computers
We describe the software architecture, technical features, and performance of TICK (Transparent Incremental Checkpointer at Kernel level), a system-level checkpointer implemented ...
Roberto Gioiosa, José Carlos Sancho, Song J...
IEEESCC
2008
IEEE
14 years 1 months ago
A Fault Tolerance Approach for Enterprise Applications
Service Oriented Architectures (SOAs) have emerged as a preferred solution to tackle the complexity of large-scale, complex, distributed, and heterogeneous systems. Key to success...
Vina Ermagan, Ingolf Krüger, Massimiliano Men...
DSN
2006
IEEE
14 years 1 months ago
Automatic Instruction-Level Software-Only Recovery
As chip densities and clock rates increase, processors are becoming more susceptible to transient faults that can affect program correctness. Computer architects have typically ad...
Jonathan Chang, George A. Reis, David I. August