Sciweavers

1256 search results - page 42 / 252
» On Coordinated Checkpointing in Distributed Systems
Sort
View
EUROMICRO
2000
IEEE
14 years 1 months ago
A Coordination Architecture for Internet Groupwork
This paper discusses a group coordination architecture to support Internet-wide distributed collaboration in the context of legacy Internet protocols. Group coordination in distri...
Hans-Peter Dommel, J. J. Garcia-Luna-Aceves
ATAL
2009
Springer
14 years 3 months ago
Learning of coordination: exploiting sparse interactions in multiagent systems
Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simplified if the coordination needs are known to be limi...
Francisco S. Melo, Manuela M. Veloso
NETWORKING
2010
13 years 10 months ago
Efficient Recovery from False State in Distributed Routing Algorithms
Abstract--Malicious and misconfigured nodes can inject incorrect state into a distributed system, which can then be propagated system-wide as a result of normal network operation. ...
Daniel Gyllstrom, Sudarshan Vasudevan, Jim Kurose,...
DAIS
2010
13 years 10 months ago
Distributed Fault Tolerant Controllers
Distributed applications are often built from sets of distributed components that must be co-ordinated in order to achieve some global behaviour. The common approach is to use a c...
Leonardo Mostarda, Rudi Ball, Naranker Dulay
IPPS
2007
IEEE
14 years 3 months ago
Improving MPI Independent Write Performance Using A Two-Stage Write-Behind Buffering Method
Many large-scale production applications often have very long executions times and require periodic data checkpoints in order to save the state of the computation for program rest...
Wei-keng Liao, Avery Ching, Kenin Coloma, Alok N. ...