Sciweavers

DATE
2009
IEEE

Fault-tolerant average execution time optimization for general-purpose multi-processor system-on-chips

14 years 4 months ago
Fault-tolerant average execution time optimization for general-purpose multi-processor system-on-chips
1 Fault-tolerance is due to the semiconductor technology development important, not only for safety-critical systems but also for general-purpose (non-safety critical) systems. However, instead of guaranteeing that deadlines always are met, it is for general-purpose systems important to minimize the average execution time (AET) while ensuring fault-tolerance. For a given job and a soft (transient) error probability, we define mathematical formulas for AET that includes bus communication overhead for both voting (active replication) and rollback-recovery with checkpointing (RRC). And, for a given multi-processor system-on-chip (MPSoC), we define integer linear programming (ILP) models that minimize AET including bus communication overhead when: (1) selecting the number of checkpoints when using RRC, (2) finding the number of processors and job-to-processor assignment when using voting, and (3) defining fault-tolerance scheme (voting or RRC) per job and defining its usage for each job. E...
Mikael Väyrynen, Virendra Singh, Erik Larsson
Added 16 Aug 2010
Updated 16 Aug 2010
Type Conference
Year 2009
Where DATE
Authors Mikael Väyrynen, Virendra Singh, Erik Larsson
Comments (0)