Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

198

GRID
2008
Springer

125views Distributed And Parallel Com...» more GRID 2008»

Troubleshooting thousands of jobs on production grids using data mining techniques

15 years 8 months ago

Troubleshooting thousands of jobs on production grids using data mining techniques

Download www.nd.edu

Large scale production computing grids introduce new challenges in debugging and troubleshooting. A user that submits a workload consisting of tens of thousands of jobs to a grid of thousands of processors has a good chance of receiving thousands of error messages as a result. How can one begin to reason about such problems? We propose that data mining techniques can be employed to classify failures according to the properties of the jobs and machines involved. We demonstrate the success of this technique through several case studies on real workloads consisting of tens of thousands of jobs. We apply the same techniques to a year’s worth of data on a 3000 CPU production grid and use it to gain a high level understanding of the system behavior.

David A. Cieslak, Nitesh V. Chawla, Douglas Thain

Real-time Traffic

CPU Production Grid | Data Mining Techniques | Distributed And Parallel Computing | GRID 2008 | Production Computing Grids |

claim paper

Related Content

» Issues in applying data mining to grid job failure detection and diagnosis

» Mining performance data for metascheduling decision support in the Grid

» Mining for misconfigured machines in grid systems

» XG A DataDriven Computation Grid for EnterpriseScale Mining

» The Grid Workloads Archive

» XG A GridEnabled Query Processing Engine

» Mining and summarizing customer reviews

» AllPairs An Abstraction for DataIntensive Computing on Campus Grids

» DryadLINQ A System for GeneralPurpose Distributed DataParallel Computing Using a HighLevel...

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2008
Where	GRID
Authors	David A. Cieslak, Nitesh V. Chawla, Douglas Thain

Comments (0)