Sciweavers

104 search results - page 10 / 21
» A Framework for Node-Level Fault Tolerance in Distributed Re...
Sort
View
HICSS
1994
IEEE
111views Biometrics» more  HICSS 1994»
13 years 11 months ago
Fault-Tolerance for Communicating Multidatabase Transactions
In this paper we propose a framework that can be used for the implementation of reliable multi database system communication protocols. The framework provides transactions on shar...
eva Kühn
IPPS
2007
IEEE
14 years 1 months ago
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI
To be able to fully exploit ever larger computing platforms, modern HPC applications and system software must be able to tolerate inevitable faults. Historically, MPI implementati...
Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox,...
IPPS
1999
IEEE
13 years 11 months ago
An Adaptive, Fault-Tolerant Implementation of BSP for JAVA-Based Volunteer Computing Systems
Abstract. In recent years, there has been a surge of interest in Javabased volunteer computing systems, which aim to make it possible to build very large parallel computing network...
Luis F. G. Sarmenta
ISORC
2007
IEEE
14 years 1 months ago
On Rigorous Design and Implementation of Fault Tolerant Ambient Systems
Developing fault tolerant ambient systems requires many challenging factors to be considered due to the nature of such systems, which tend to contain a lot of mobile elements that...
Alexei Iliasov, Alexander Romanovsky, Budi Arief, ...
USENIX
2008
13 years 9 months ago
Improving Scalability and Fault Tolerance in an Application Management Infrastructure
This paper explores the challenges associated with distributed application management in large-scale computing environments. In particular, we investigate several techniques for e...
Nikolay Topilski, Jeannie R. Albrecht, Amin Vahdat