Sciweavers

2400 search results - page 180 / 480
» Systems Failures
Sort
View
100
Voted
DSN
2005
IEEE
15 years 8 months ago
Crash Data Collection: A Windows Case Study
Reliability is a rapidly growing concern in contemporary Personal Computer (PC) industry, both for computer users as well as product developers. To improve dependability, systems ...
Archana Ganapathi, David A. Patterson
222
Voted
ASPLOS
2009
ACM
16 years 3 months ago
Anomaly-based bug prediction, isolation, and validation: an automated approach for software debugging
Software defects, commonly known as bugs, present a serious challenge for system reliability and dependability. Once a program failure is observed, the debugging activities to loc...
Martin Dimitrov, Huiyang Zhou
96
Voted
IPPS
2007
IEEE
15 years 8 months ago
RI2N/UDP: High bandwidth and fault-tolerant network for a PC-cluster based on multi-link Ethernet
PC-clusters with high performance/cost ratio have been one of the typical platforms for high performance computing. To lower costs, Gigabit Ethernet is often used for intercommuni...
Takayuki Okamoto, Shin'ichi Miura, Taisuke Boku, M...
CLOUDCOM
2009
Springer
15 years 5 months ago
Decentralized Service Allocation in a Broker Overlay Based Grid
Abstract. Grid computing is based on coordinated resource sharing in a dynamic environment of multi-institutional virtual organizations. Data exchanges, and service allocation, are...
Abdulrahman Azab, Hein Meling
116
Voted
JCP
2007
104views more  JCP 2007»
15 years 2 months ago
An Integrated Self-Testing Framework for Autonomic Computing Systems
Abstract— As the technologies of autonomic computing become more prevalent, it is essential to develop methodologies for testing their dynamic self-management operations. Self-ma...
Tariq M. King, Alain E. Ramirez, Rodolfo Cruz, Pet...