Sciweavers

371 search results - page 29 / 75
» Collective Error Detection for MPI Collective Operations
Sort
View
LISA
2000
13 years 10 months ago
Aberrant Behavior Detection in Time Series for Network Monitoring
The open-source software RRDtool and Cricket provide a solution to the problem of collecting, storing, and visualizing service network time series data for the real-time monitorin...
Jake D. Brutlag
WISES
2004
13 years 10 months ago
Using Triggers to Find Significant Events During Monitoring of Real-Time Systems
-- A main prerequisite for the successful development of real-time systems is the capability for debugging and monitoring of these systems. The debugging of real-time systems is mo...
Idriz Smaili
CCGRID
2009
IEEE
14 years 3 months ago
Improving Parallel Write by Node-Level Request Scheduling
In a cluster of multiple processors or cpu-cores, many processes may run on each compute node. Each process tends to issue contiguous I/O requests for snapshot, checkpointing or s...
Kazuki Ohta, Hiroya Matsuba, Yutaka Ishikawa
CLUSTER
2007
IEEE
14 years 18 days ago
A feasibility analysis of power-awareness and energy minimization in modern interconnects for high-performance computing
High-performance computing (HPC) systems consume a significant amount of power, resulting in high operational costs, reduced reliability, and wasting of natural resources. Therefor...
Reza Zamani, Ahmad Afsahi, Ying Qian, V. Carl Hama...
ACSAC
2000
IEEE
14 years 1 months ago
A Network Audit System for Host-based Intrusion Detection (NASHID) in Linux
Recent work has shown that conventional operating system audit trails are insufficient to detect low-level network attacks. Because audit trails are typically based upon system c...
Thomas E. Daniels, Eugene H. Spafford