Sciweavers

HPCA
2011
IEEE

Exploiting criticality to reduce bottlenecks in distributed uniprocessors

13 years 3 months ago
Exploiting criticality to reduce bottlenecks in distributed uniprocessors
Composable multicore systems merge multiple independent cores for running sequential single-threaded workloads. The performance scalability of these systems, however, is limited due to partitioning overheads. This paper addresses two of the key performance scalability limitations of composable multicore systems. We present a critical path analysis revealing that communication needed for cross-core register value delivery and fetch stalls due to misspeculation are the two worst bottlenecks that prevent efficient scaling to a large number of fused cores. To alleviate these bottlenecks, this paper proposes a fully distributed framework to exploit criticality in these architectures at different granularities. A coordinator core exploits different types of block-level communication criticality information to fine-tune critical instructions at decode and register forward pipeline stages of their executing cores. The framework exploits the fetch criticality information at a coarser granula...
Behnam Robatmili, Madhu Saravana Sibi Govindan, Do
Added 20 Aug 2011
Updated 20 Aug 2011
Type Journal
Year 2011
Where HPCA
Authors Behnam Robatmili, Madhu Saravana Sibi Govindan, Doug Burger, Stephen W. Keckler
Comments (0)