Sciweavers

PODC
2010
ACM

Adaptive system anomaly prediction for large-scale hosting infrastructures

14 years 3 months ago
Adaptive system anomaly prediction for large-scale hosting infrastructures
Large-scale hosting infrastructures require automatic system anomaly management to achieve continuous system operation. In this paper, we present a novel adaptive runtime anomaly prediction system, called ALERT, to achieve robust hosting infrastructures. In contrast to traditional anomaly detection schemes, ALERT aims at raising advance anomaly alerts to achieve just-in-time anomaly prevention. We propose a novel context-aware anomaly prediction scheme to improve prediction accuracy in dynamic hosting infrastructures. We have implemented the ALERT system and deployed it on several production hosting infrastructures such as IBM System S stream processing cluster and PlanetLab. Our experiments show that ALERT can achieve high prediction accuracy for a range of system anomalies and impose low overhead to the hosting infrastructure. Categories and Subject Descriptors C.4 [Performance of Systems]: Reliability, availability, and serviceability General Terms Reliability, Management, Experime...
Yongmin Tan, Xiaohui Gu, Haixun Wang
Added 16 Aug 2010
Updated 16 Aug 2010
Type Conference
Year 2010
Where PODC
Authors Yongmin Tan, Xiaohui Gu, Haixun Wang
Comments (0)