Transient fault prediction based on anomalies in processor events

15 years 11 months ago

Download cseweb.ucsd.edu

Future microprocessors will be highly susceptible to transient errors as the sizes of transistors decrease due to CMOS scaling. Prior techniques advocated full scale structural or temporal redundancy to achieve fault tolerance. Though they can provide complete fault coverage, they incur signiﬁcant hardware and/or performance cost. It is desirable to have mechanisms that can provide partial but sufﬁciently high fault coverage with negligible cost. To meet this goal, we propose leveraging speculative structures that already exist in modern processors. The proposed mechanism is based on the insight that when a fault occurs, it is likely that the incorrect execution would result in abnormally higher or lower number of mispredictions (branch mispredictions, L2 misses, store set mispredictions) than a correct execution. We design a simple transient fault predictor that detects the anomalous behavior in the outcomes of the speculative structures to predict transient faults.

Satish Narayanasamy, Ayse Kivilcim Coskun, Brad Ca

Real-time Traffic

Complete Fault Coverage | DATE 2007 | Fault Coverage | Hardware | Transient Faults |

claim paper

» Analysis of checksumbased execution schemes for pipelined processors

» Configurable isolation building high availability systems with commodity multicore process...

» SlicK slicebased locality exploitation for efficient redundant multithreading

Post Info
More Details (n/a)

Added	02 Jun 2010
Updated	02 Jun 2010
Type	Conference
Year	2007
Where	DATE
Authors	Satish Narayanasamy, Ayse Kivilcim Coskun, Brad Calder

Comments (0)

Sciweavers

Transient fault prediction based on anomalies in processor events

Complete Fault Coverage | DATE 2007 | Fault Coverage | Hardware | Transient Faults |

Explore & Download

Productivity Tools

Sciweavers