Sciweavers

2400 search results - page 172 / 480
» Systems Failures
Sort
View
IEEEHPCS
2010
13 years 7 months ago
Resilient workflows for high-performance simulation platforms
Workflows systems are considered here to support largescale multiphysics simulations. Because the use of large distributed and parallel multi-core infrastructures is prone to soft...
Toan Nguyen, Laurentiu Trifan, Jean-Antoine Deside...
MICRO
2009
IEEE
128views Hardware» more  MICRO 2009»
14 years 3 months ago
mSWAT: low-cost hardware fault detection and diagnosis for multicore systems
Continued technology scaling is resulting in systems with billions of devices. Unfortunately, these devices are prone to failures from various sources, resulting in even commodity...
Siva Kumar Sastry Hari, Man-Lap Li, Pradeep Ramach...
DSN
2004
IEEE
14 years 29 days ago
Assured Reconfiguration of Embedded Real-Time Software
It is often the case that safety-critical systems have to be reconfigured during operation because of issues such as changes in the system's operating environment or the fail...
Elisabeth A. Strunk, John C. Knight
USENIX
2008
13 years 11 months ago
Using Causality to Diagnose Configuration Bugs
We present a novel method for diagnosing configuration management errors. Our proposed approach deduces the state of a buggy computer by running predicates that test system correc...
Mona Attariyan, Jason Flinn
JITECH
2007
220views more  JITECH 2007»
13 years 9 months ago
Fixing the payment system at Alvalade XXI: a case on IT project risk management
This case describes the implementation and subsequent failure of an innovative system installed in the bars of Alvalade XXI, the recently built football stadium in Lisbon, Portuga...
Ramon O'Callaghan