failure prediction | Sciweavers

150

BERTINORO
2005
Springer

128views Information Technology» more BERTINORO 2005»

Prediction-Based Software Availability Enhancement

16 years 3 days ago

Download www.rok.informatik.hu-berlin.de

We propose a new paradigm for software availability enhancement. We offer a two-step strategy: Failure prediction followed by maintenance actions with the objective of avoiding imp...

Felix Salfner, Günther A. Hoffmann, Miroslaw ...

claim paper

Read More »

163

click to vote

IPPS
2005
IEEE

106views Distributed And Parallel Com...» more IPPS 2005»

Proactive Fault Handling for System Availability Enhancement

16 years 4 days ago

Download www2.informatik.hu-berlin.de

Proactive fault handling combines prevention and repair actions with failure prediction techniques. We extend the standard availability formula by ﬁve key measures: (1) precisio...

Felix Salfner, Miroslaw Malek

claim paper

Read More »

172

click to vote

DSN
2006
IEEE

138views Computer Networks» more DSN 2006»

BlueGene/L Failure Analysis and Prediction Models

16 years 18 days ago

Download www.ece.rutgers.edu

The growing computational and storage needs of several scientiﬁc applications mandate the deployment of extreme-scale parallel machines, such as IBM’s BlueGene/L which can acc...

Yinglung Liang, Yanyong Zhang, Anand Sivasubramani...

claim paper

Read More »

168

click to vote

CCGRID
2006
IEEE

125views Distributed And Parallel Com...» more CCGRID 2006»

Exploit Failure Prediction for Adaptive Fault-Tolerance in Cluster Computing

16 years 19 days ago

Download www.cs.iit.edu

As the scale of cluster computing grows, it is becoming hard for long-running applications to complete without facing failures on large-scale clusters. To address this issue, chec...

Yawei Li, Zhiling Lan

claim paper

Read More »

182

click to vote

SRDS
2007
IEEE

94views Operating System» more SRDS 2007»

Using Hidden Semi-Markov Models for Effective Online Failure Prediction

16 years 25 days ago

Download www.srds2007.org

A proactive handling of faults requires that the risk of upcoming failures is continuously assessed. One of the promising approaches is online failure prediction, which means that...

Felix Salfner, Miroslaw Malek

claim paper

Read More »

160

click to vote

ICPP
2007
IEEE

89views Distributed And Parallel Com...» more ICPP 2007»

Fault-Driven Re-Scheduling For Improving System-level Fault Resilience

16 years 26 days ago

Download www.cs.iit.edu

The productivity of HPC system is determined not only by their performance, but also by their reliability. The conventional method to limit the impact of failures is checkpointing...

Yawei Li, Prashasta Gujrati, Zhiling Lan, Xian-He ...

claim paper

Read More »

181

click to vote

ANSS
2007
IEEE

114views Modeling and Simulation» more ANSS 2007»

Failure Prediction in Computational Grids

16 years 27 days ago

Download www.cs.virginia.edu

Accurate failure prediction in Grids is critical for reasoning about QoS guarantees such as job completion time and availability. Statistical methods can be used but they suffer f...

Woochul Kang, Andrew S. Grimshaw

claim paper

Read More »

184

click to vote

ICPP
2008
IEEE

152views Distributed And Parallel Com...» more ICPP 2008»

Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems: A Case Study

16 years 1 months ago

Download www.cs.iit.edu

Despite great efforts on the design of ultra-reliable components, the increase of system size and complexity has outpaced the improvement of component reliability. As a result, fa...

Jiexing Gu, Ziming Zheng, Zhiling Lan, John White,...

claim paper

Read More »

186

click to vote

DSN
2009
IEEE

146views Computer Networks» more DSN 2009»

System log pre-processing to improve failure prediction

16 years 1 months ago

Download www.cs.iit.edu

Log preprocessing, a process applied on the raw log before applying a predictive method, is of paramount importance to failure prediction and diagnosis. While existing ﬁltering ...

Ziming Zheng, Zhiling Lan, Byung-Hoon Park, Al Gei...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers