Sciweavers

392 search results - page 44 / 79
» Fault Tolerance in a DSM Cluster Operating System
Sort
View
ISCAS
2005
IEEE
129views Hardware» more  ISCAS 2005»
14 years 2 months ago
An analytical approach for soft error rate estimation in digital circuits
—Soft errors due to cosmic rays cause reliability problems during lifetime operation of digital systems, which increase exponentially with Moore’s law. The first step in develo...
Ghazanfar Asadi, Mehdi Baradaran Tahoori
EDCC
2008
Springer
13 years 10 months ago
A Distributed Approach to Autonomous Fault Treatment in Spread
This paper presents the design and implementation of the Distributed Autonomous Replication Management (DARM) framework built on top of the Spread group communication system. The ...
Hein Meling, Joakim L. Gilje
PDPTA
2003
13 years 10 months ago
Subway: Peer-to-Peer Clustering Of Clients for Web Proxy
Many cooperated web cache systems and protocols have been proposed. But, these systems need the expensive resources, such as core-link bandwidth and proxy cpu or storage, and need...
Kyungbaek Kim, Daeyeon Park
SRDS
1998
IEEE
14 years 27 days ago
System-Level Versus User-Defined Checkpointing
Checkpointing and rollback recovery is a very effective technique to tolerate transient faults and preventive shutdowns. In the past, most of the checkpointing schemes published i...
Luís Moura Silva, João Gabriel Silva
ICCD
2006
IEEE
113views Hardware» more  ICCD 2006»
14 years 5 months ago
A theory of Error-Rate Testing
— We have entered an era where chip yields are decreasing with scaling. A new concept called intelligible testing has been previously proposed with the goal of reversing this tre...
Shideh Shahidi, Sandeep Gupta