Sciweavers

57 search results - page 5 / 12
» The fault span of crash failures
Sort
View
IPPS
2007
IEEE
14 years 5 months ago
Tiresias: Black-Box Failure Prediction in Distributed Systems
Faults in distributed systems can result in errors that manifest in several ways, potentially even in parts of the system that are not collocated with the root cause. These manife...
Andrew W. Williams, Soila M. Pertet, Priya Narasim...
DSN
2005
IEEE
14 years 4 months ago
A System Demonstration of ST-TCP
ST-TCP (Server fault-Tolerant TCP) is an extension of TCP to tolerate TCP server failures. Server fault tolerance is provided by using an active-backup server that keeps track of ...
Manish Marwah, Shivakant Mishra, Christof Fetzer
IPPS
2006
IEEE
14 years 4 months ago
Fault injection in distributed Java applications
In a network consisting of several thousands computers, the occurrence of faults is unavoidable. Being able to test the behaviour of a distributed program in an environment where ...
William Hoarau, Sébastien Tixeuil, Fabien V...
DEXAW
2008
IEEE
161views Database» more  DEXAW 2008»
14 years 5 months ago
Model-Based QoS-Enabled Self-Healing Web Services
Failures during web service execution may depend on a wide variety of causes, such as network faults, server crashes, or application-related errors, such as unavailability of a re...
Olga Nabuco, Riadh Ben Halima, Khalil Drira, Maria...
RV
2010
Springer
172views Hardware» more  RV 2010»
13 years 9 months ago
Recovery Tasks: An Automated Approach to Failure Recovery
Abstract. We present a new approach for developing robust software applications that breaks dependences on the failed parts of an application’s execution to allow the rest of the...
Brian Demsky, Jin Zhou, William Montaz