Sciweavers

1256 search results - page 42 / 252
» On Coordinated Checkpointing in Distributed Systems
Sort
View
113
Voted
EUROMICRO
2000
IEEE
15 years 8 months ago
A Coordination Architecture for Internet Groupwork
This paper discusses a group coordination architecture to support Internet-wide distributed collaboration in the context of legacy Internet protocols. Group coordination in distri...
Hans-Peter Dommel, J. J. Garcia-Luna-Aceves
128
Voted
ATAL
2009
Springer
15 years 10 months ago
Learning of coordination: exploiting sparse interactions in multiagent systems
Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simplified if the coordination needs are known to be limi...
Francisco S. Melo, Manuela M. Veloso
112
Voted
NETWORKING
2010
15 years 5 months ago
Efficient Recovery from False State in Distributed Routing Algorithms
Abstract--Malicious and misconfigured nodes can inject incorrect state into a distributed system, which can then be propagated system-wide as a result of normal network operation. ...
Daniel Gyllstrom, Sudarshan Vasudevan, Jim Kurose,...
113
Voted
DAIS
2010
15 years 5 months ago
Distributed Fault Tolerant Controllers
Distributed applications are often built from sets of distributed components that must be co-ordinated in order to achieve some global behaviour. The common approach is to use a c...
Leonardo Mostarda, Rudi Ball, Naranker Dulay
119
Voted
IPPS
2007
IEEE
15 years 10 months ago
Improving MPI Independent Write Performance Using A Two-Stage Write-Behind Buffering Method
Many large-scale production applications often have very long executions times and require periodic data checkpoints in order to save the state of the computation for program rest...
Wei-keng Liao, Avery Ching, Kenin Coloma, Alok N. ...