Sciweavers

2212 search results - page 4 / 443
» Coordinating Open Distributed Systems
Sort
View
HPDC
2009
IEEE
14 years 2 months ago
Interconnect agnostic checkpoint/restart in open MPI
Long running High Performance Computing (HPC) applications at scale must be able to tolerate inevitable faults if they are to harness current and future HPC systems. Message Passi...
Joshua Hursey, Timothy Mattox, Andrew Lumsdaine
COORDINATION
2005
Springer
14 years 1 months ago
Coordination Systems in Role-Based Adaptive Software
Software systems are becoming more open, distributed, pervasive, and connected. In such systems, the relationships between loosely-coupled application elements become non-determini...
Alan W. Colman, Jun Han
ICPP
2009
IEEE
14 years 2 months ago
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems
—Considerable work has been done on providing fault tolerance capabilities for different software components on largescale high-end computing systems. Thus far, however, these fa...
Rinku Gupta, Pete Beckman, Byung-Hoon Park, Ewing ...
MIDDLEWARE
2009
Springer
14 years 2 months ago
Achieving Coordination through Dynamic Construction of Open Workflows
Louis Thomas, Justin Wilson, Gruia-Catalin Roman, ...