Sciweavers

148 search results - page 1 / 30
» Intelligent system monitoring on large clusters
Sort
View
DMSN
2006
ACM
14 years 1 months ago
Intelligent system monitoring on large clusters
Modern data centers have a large number of components that must be monitored, including servers, switches/routers, and environmental control systems. This paper describes InteMon,...
Jimeng Sun, Evan Hoke, John D. Strunk, Gregory R. ...
CLUSTER
2004
IEEE
13 years 11 months ago
NWPerf: a system wide performance monitoring tool for large Linux clusters
Ryan W. Mooney, Ken P. Schmidt, R. Scott Studham
IPPS
2005
IEEE
14 years 1 months ago
Monitoring and Debugging Parallel Software with BCS-MPI on Large-Scale Clusters
Buffered CoScheduled (BCS) MPI is a novel implementation of MPI based on global synchronization of all system activities. BCS-MPI imposes a model where all processes and their com...
Juan Fernández, Fabrizio Petrini, Eitan Fra...
CCGRID
2006
IEEE
13 years 11 months ago
IPMI-based Efficient Notification Framework for Large Scale Cluster Computing
The demand for an efficient fault tolerance system has led to the development of complex monitoring infrastructure, which in turn has created an overwhelming task of data and even...
Chokchai Leangsuksun, Tirumala Rao, Anand Tikoteka...