Sciweavers

COMSNETS
2012

Varanus: More-with-less fault localization in data centers

12 years 7 months ago
Varanus: More-with-less fault localization in data centers
Abstract—Detecting and localizing performance faults is crucial for operating large enterprise data centers. This problem is relatively straightforward to solve if each entity (applications, servers, business processes) within the data center can be instrumented and monitored explicitly. Unfortunately, such instrument-everything approach is often not tenable because of the limits imposed by enterprises on the permissible amounts of instrumentation intrusiveness and monitoring overhead. In this paper, we address the problem of achieving high accuracy of detecting and localizing performance faults in data centers, while minimizing the required instrumentation intrusiveness and overhead. We present novel algorithms for solving three key subproblems: (1) How many monitors are required and where should they be placed within the data center? (2) Given the proposed instrumentation plan, how to detect the existence of performance faults accurately? and (3) How to localize the root-cause of t...
Vaishali P. Sadaphal, Maitreya Natu, Harrick M. Vi
Added 20 Apr 2012
Updated 20 Apr 2012
Type Journal
Year 2012
Where COMSNETS
Authors Vaishali P. Sadaphal, Maitreya Natu, Harrick M. Vin, Prashant J. Shenoy
Comments (0)