Search Sciweavers | Sciweavers

37

CLUSTER
2006
IEEE

152views Distributed And Parallel Com...» more CLUSTER 2006»

JOSHUA: Symmetric Active/Active Replication for Highly Available HPC Job and Resource Management

14 years 3 months ago

Most of today‘s HPC systems employ a single head node for control, which represents a single point of failure as it interrupts an entire HPC system upon failure. Furthermore, it...

Kai Uhlemann, Christian Engelmann, Stephen L. Scot...

claim paper

Read More »

25

click to vote

ICDCS
2010
IEEE

141views Distributed And Parallel Com...» more ICDCS 2010»

Minimizing Probing Cost and Achieving Identifiability in Network Link Monitoring

13 years 10 months ago

Download mcn.cse.psu.edu

Continuously monitoring the link performance is important to network diagnosis. Recently, active probes sent between end systems are widely used to monitor the link performance. I...

Qiang Zheng, Guohong Cao

claim paper

Read More »

22

click to vote

IPPS
2007
IEEE

91views Distributed And Parallel Com...» more IPPS 2007»

Detecting Runtime Environment Interference with Parallel Application Behavior

14 years 3 months ago

Download www.cecs.uci.edu

Many performance problems observed in high end systems are actually caused by the runtime system and not the application code. Detecting these cases will require parallel performa...

Rashawn L. Knapp, Karen L. Karavanic, Douglas M. P...

claim paper

Read More »

24

click to vote

FOSSACS
2006
Springer

92views Software Engineering» more FOSSACS 2006»

Distributed Unfolding of Petri Nets

14 years 19 days ago

Download www.math.unipd.it

Some recent Petri net-based approaches to fault diagnosis of distributed systems suggest to factor the problem into local diagnoses based on the unfoldings of local views of the sy...

Paolo Baldan, Stefan Haar, Barbara König

claim paper

Read More »

26

click to vote

CDC
2008
IEEE

115views Control Systems» more CDC 2008»

Optimal sensor activation in controlled discrete event systems

14 years 3 months ago

Download www.eecs.umich.edu

— The problem of sensor activation in a controlled discrete event system is considered. Sensors are assumed to be costly and can be turned on/off during the operation of the syst...

Weilin Wang, Stéphane Lafortune, Feng Lin

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers