IPMI-based Efficient Notification Framework for Large Scale Cluster Computing

15 years 10 months ago

Download ft.ornl.gov

The demand for an efficient fault tolerance system has led to the development of complex monitoring infrastructure, which in turn has created an overwhelming task of data and event management. The increasing level of details at the hardware and software layer clearly affects the scalability and performance of monitoring and management tools. In this paper, we propose a problem notification framework that directly addresses the issue of monitor scalability. We first present the design and implementation of our step-by-step approach to analyzing, filtering, and classifying the plethora of node statistics. Then, we present experimental results to show that our approach only needs minimal system resource and thus has low overhead. Finally, we introduce our web-based cluster management system that provides hardware controls at both cluster and nodal levels. Key words: Scalability, High-Availability, IPMI.

Chokchai Leangsuksun, Tirumala Rao, Anand Tikoteka

Real-time Traffic

CCGRID 2006 | Complex Monitoring Infrastructure | Distributed And Parallel Computing | Efficient Fault Tolerance | Monitor Scalability |

claim paper

» Communications via SystemsonChips Clustering in LargeScaled Sensor Networks

» A metascalable computing framework for large spatiotemporalscale atomistic simulations

» Efficiently clustering transactional data with weighted coverage density

» Boosting for ModelBased Data Clustering

» Scaling Populations of a Genetic Algorithm for Job Shop Scheduling Problems Using MapReduc...

» Improvement of PowerPerformance Efficiency for HighEnd Computing

» Counting Solution Clusters in Graph Coloring Problems Using Belief Propagation

» Largescale multidimensional document clustering on GPU clusters

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2006
Where	CCGRID
Authors	Chokchai Leangsuksun, Tirumala Rao, Anand Tikotekar, Stephen L. Scott, Richard Libby, Jeffrey S. Vetter, Yung-Chin Fang, Hong Ong

Comments (0)

Sciweavers

IPMI-based Efficient Notification Framework for Large Scale Cluster Computing

CCGRID 2006 | Complex Monitoring Infrastructure | Distributed And Parallel Computing | Efficient Fault Tolerance | Monitor Scalability |

Explore & Download

Productivity Tools

Sciweavers