Why Do Internet Services Fail, and What Can Be Done About It?

15 years 8 months ago

Download roc.cs.berkeley.edu

In 1986 Jim Gray published his landmark study of the causes of failures of Tandem systems and the techniques Tandem used to prevent such failures [6]. Seventeen years later, Internet services have replaced fault-tolerant servers as the new kid on the 24x7-availability block. Using data from three large-scale Internet services, we analyzed the causes of their failures and the (potential) effectiveness of various techniques for preventing and mitigating service failure. We find that (1) operator error is the largest cause of failures in two of the three services, (2) operator error is the largest contributor to time to repair in two of the three services, (3) configuration errors are the largest category of operator errors, (4) failures in custom-written front-end software are significant, and (5) more extensive online testing and more thoroughly exposing and detecting component failures would reduce failure rates in at least one service. Qualitatively we find that improvement in the ma...

David L. Oppenheimer, Archana Ganapathi, David A.

Real-time Traffic

Component Failures | Internet Services | Operating System | Operator Errors | USITS 2003 |

claim paper

Related Content

» Computer Security in the Real World

» The Resurrecting Duckling What Next

» A knowledge plane for the internet

» pTrust A New Model of Trust to Allow Finer Control Over Privacy in PeertoPeer Framework

» An Architecture and Business Model for Making Software Agents Commercially Viable

» Experiences with Greylisting

» Visualizing web site comparisons

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2003
Where	USITS
Authors	David L. Oppenheimer, Archana Ganapathi, David A. Patterson

Comments (0)

Sciweavers

Why Do Internet Services Fail, and What Can Be Done About It?

Component Failures | Internet Services | Operating System | Operator Errors | USITS 2003 |

Explore & Download

Productivity Tools

Sciweavers