Sciweavers

2400 search results - page 94 / 480
» Systems Failures
Sort
View
SRDS
2008
IEEE
14 years 3 months ago
Dynamically Quantifying and Improving the Reliability of Distributed Storage Systems
In this paper, we argue that the reliability of large-scale storage systems can be significantly improved by using better reliability metrics and more efficient policies for rec...
Rekha Bachwani, Leszek Gryz, Ricardo Bianchini, Ce...
CBSE
2005
Springer
14 years 2 months ago
Tailored Responsibility Within Component-Based Systems
The concept of responsibility aims at making a computing system trustworthy for its users despite the fact that failures of IT systems cannot be completely excluded. The presented ...
Elke Franz, Ute Wappler
MMAS
2004
Springer
14 years 2 months ago
Towards Fault-Tolerant Massively Multiagent Systems
Abstract. In order to construct and deploy massively multiagent systems, we must address one of the fundamental issues of distributed systems, the possibility of partial failures. ...
Zahia Guessoum, Jean-Pierre Briot, Nora Faci
IPPS
2000
IEEE
14 years 1 months ago
Network Survivability Simulation of a Commercially Deployed Dynamic Routing System Protocol
With the ever-increasing demands on server applications, many new server services are distributed in nature. We evaluated one hundred deployed systems and found that over a one-yea...
Abdur Chowdhury, Ophir Frieder, Paul Luse, Peng-Ju...
ASPLOS
2011
ACM
13 years 19 days ago
Improving software diagnosability via log enhancement
Diagnosing software failures in the field is notoriously difficult, in part due to the fundamental complexity of trouble-shooting any complex software system, but further exacer...
Ding Yuan, Jing Zheng, Soyeon Park, Yuanyuan Zhou,...