Sciweavers

106 search results - page 15 / 22
» Transparent Fault Tolerance for Grid Applications
Sort
View
ESCIENCE
2005
IEEE
14 years 2 months ago
The GriddLeS Data Replication Service
The Grid provides infrastructure that allows an arbitrary application to be executed on a range of different computational resources. When input files are very large, or when faul...
Tim Ho, David Abramson
DSE
1998
80views more  DSE 1998»
13 years 8 months ago
The Voltan application programming environment for fail-silent processes
The Voltan software library for building distributed applications provides the support for (i) a processpair to act as single Voltan self-checking ‘fail-silent’ process; and (...
Dave Black, C. Low, Santosh K. Shrivastava
ESCIENCE
2006
IEEE
14 years 2 months ago
A Unified Data Grid Replication Framework
Modern scientific experiments can generate large amounts of data, which may be replicated and distributed across multiple resources to improve application performance and fault to...
Tim Ho, David Abramson
HPDC
2009
IEEE
14 years 3 months ago
Interconnect agnostic checkpoint/restart in open MPI
Long running High Performance Computing (HPC) applications at scale must be able to tolerate inevitable faults if they are to harness current and future HPC systems. Message Passi...
Joshua Hursey, Timothy Mattox, Andrew Lumsdaine
GRID
2004
Springer
14 years 2 months ago
Checkpoint and Restart for Distributed Components in XCAT3
With the advent of Grid computing, more and more highend computational resources become available for use to a scientist. While this opens up new avenues for scientific research,...
Sriram Krishnan, Dennis Gannon