Sciweavers

615 search results - page 10 / 123
» An Architecture for Supporting Network Fault Recovery Manage...
Sort
View
DAC
2011
ACM
12 years 7 months ago
Generalized reliability-oriented energy management for real-time embedded applications
DVFS remains an important energy management technique for embedded systems. However, its negative impact on transient fault rates has been recently shown. In this paper, we propos...
Baoxian Zhao, Hakan Aydin, Dakai Zhu
ISCA
2002
IEEE
115views Hardware» more  ISCA 2002»
14 years 11 days ago
SafetyNet: Improving the Availability of Shared Memory Multiprocessors with Global Checkpoint/Recovery
We develop an availability solution, called SafetyNet, that uses a unified, lightweight checkpoint/recovery mechanism to support multiple long-latency fault detection schemes. At...
Daniel J. Sorin, Milo M. K. Martin, Mark D. Hill, ...
ATC
2007
Springer
14 years 1 months ago
An Architecture for Self-healing Autonomous Object Groups
Abstract. Jgroup/ARM is a middleware for developing and operating dependable distributed Java applications. Jgroup integrates the distributed object model of Java RMI with the obje...
Hein Meling
DSN
2008
IEEE
14 years 1 months ago
A characterization of instruction-level error derating and its implications for error detection
In this work, we characterize a significant source of software derating that we call instruction-level derating. Instruction-level derating encompasses the mechanisms by which co...
Jeffrey J. Cook, Craig B. Zilles
JCP
2006
115views more  JCP 2006»
13 years 7 months ago
Fault Tolerance in a Multi-Layered DRE System: A Case Study
Dynamic resource management is a crucial part of the infrastructure for emerging distributed real-time embedded systems, responsible for keeping mission-critical applications opera...
Paul Rubel, Joseph P. Loyall, Richard E. Schantz, ...