Sciweavers

177 search results - page 3 / 36
» Observation and Control for Debugging Distributed Computatio...
Sort
View
IPPS
1994
IEEE
13 years 11 months ago
Toward Flexible Control of the Temporal Mapping from Concurrent Program Events to Animations
As parallel and distributed computers become more widely available and used, the already important process of understanding and debugging concurrent programs will take on even gre...
Eileen Kraemer, John T. Stasko
CDC
2009
IEEE
119views Control Systems» more  CDC 2009»
14 years 8 days ago
On observer based stabilization of networked systems
— Stabilizability of linear time invariant networked systems of general structure is studied with an observer-based approach. In the assumption of piecewise constant controls an ...
Paolo Caravani, Elena De Santis
ICDCS
2010
IEEE
13 years 11 months ago
Visual, Log-Based Causal Tracing for Performance Debugging of MapReduce Systems
Abstract—The distributed nature and large scale of MapReduce programs and systems poses two challenges in using existing profiling and debugging tools to understand MapReduce pr...
Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Nar...
DSN
2006
IEEE
14 years 1 months ago
Automatic Recovery Using Bounded Partially Observable Markov Decision Processes
This paper provides a technique, based on partially observable Markov decision processes (POMDPs), for building automatic recovery controllers to guide distributed system recovery...
Kaustubh R. Joshi, William H. Sanders, Matti A. Hi...
IPPS
2005
IEEE
14 years 1 months ago
Monitoring and Debugging Parallel Software with BCS-MPI on Large-Scale Clusters
Buffered CoScheduled (BCS) MPI is a novel implementation of MPI based on global synchronization of all system activities. BCS-MPI imposes a model where all processes and their com...
Juan Fernández, Fabrizio Petrini, Eitan Fra...