Sciweavers

342 search results - page 27 / 69
» A planning based approach to failure recovery in distributed...
Sort
View
SRDS
2006
IEEE
14 years 2 months ago
Proactive Resilience Revisited: The Delicate Balance Between Resisting Intrusions and Remaining Available
In a recent paper, we presented proactive resilience as a new approach to proactive recovery, based on architectural hybridization. We showed that, with appropriate assumptions ab...
Paulo Sousa, Nuno Ferreira Neves, Paulo Verí...
INFOCOM
2012
IEEE
11 years 11 months ago
Sherlock is around: Detecting network failures with local evidence fusion
—Traditional approaches for wireless sensor network diagnosis are mainly sink-based. They actively collect global evidences from sensor nodes to the sink so as to conduct central...
Qiang Ma, Kebin Liu, Xin Miao, Yunhao Liu
DAIS
2006
13 years 10 months ago
Using Speculative Push for Unnecessary Checkpoint Creation Avoidance
Abstract. This paper discusses a way of incorporating speculation techniques into Distributed Shared Memory (DSM) systems with checkpointing mechanism without creating unnecessary ...
Arkadiusz Danilecki, Michal Szychowiak
SEKE
2010
Springer
13 years 7 months ago
Distributed and Adaptive Execution of Condor DAGMan Workflows
— Large-scale applications, in the form of workflows, may require the coordinated usage of resources spreading across multiple administrative domains. Scalable solutions need a d...
Selim Kalayci, Gargi Dasgupta, Liana Fong, Onyeka ...
CCGRID
2010
IEEE
13 years 10 months ago
Team-Based Message Logging: Preliminary Results
Fault tolerance will be a fundamental imperative in the next decade as machines containing hundreds of thousands of cores will be installed at various locations. In this context, ...
Esteban Meneses, Celso L. Mendes, Laxmikant V. Kal...