HPCA 2009 | Sciweavers

129

HPCA
2009
IEEE

163views Distributed And Parallel Com...» more HPCA 2009»

In-Network Snoop Ordering (INSO): Snoopy coherence on unordered interconnects

16 years 5 months ago

Realizing scalable cache coherence in the many-core era comes with a whole new set of constraints and opportunities. It is widely believed that multi-hop, unordered on-chip networ...

Niket Agarwal, Li-Shiuan Peh, Niraj K. Jha

claim paper

Read More »

139

click to vote

HPCA
2009
IEEE

156views Distributed And Parallel Com...» more HPCA 2009»

Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems

16 years 5 months ago

Download www.ece.cmu.edu

Linked data structure (LDS) accesses are critical to the performance of many large scale applications. Techniques have been proposed to prefetch such accesses. Unfortunately, many...

Eiman Ebrahimi, Onur Mutlu, Yale N. Patt

claim paper

Read More »

144

click to vote

HPCA
2009
IEEE

176views Distributed And Parallel Com...» more HPCA 2009»

Design and implementation of software-managed caches for multicores with local memory

16 years 5 months ago

Download www.multicoreinfo.com

Heterogeneous multicores, such as Cell BE processors and GPGPUs, typically do not have caches for their accelerator cores because coherence traffic, cache misses, and latencies fr...

Sangmin Seo, Jaejin Lee, Zehra Sura

claim paper

Read More »

143

click to vote

HPCA
2009
IEEE

132views Distributed And Parallel Com...» more HPCA 2009»

Practical off-chip meta-data for temporal memory streaming

16 years 5 months ago

Download www.eecg.toronto.edu

Prior research demonstrates that temporal memory streaming and related address-correlating prefetchers improve performance of commercial server workloads though increased memory l...

Thomas F. Wenisch, Michael Ferdman, Anastasia Aila...

claim paper

Read More »

146

click to vote

HPCA
2009
IEEE

166views Distributed And Parallel Com...» more HPCA 2009»

Bridging the computation gap between programmable processors and hardwired accelerators

16 years 5 months ago

Download cccp.eecs.umich.edu

New media and signal processing applications demand ever higher performance while operating within the tight power constraints of mobile devices. A range of hardware implementatio...

Kevin Fan, Manjunath Kudlur, Ganesh S. Dasika, Sco...

claim paper

Read More »

126

click to vote

HPCA
2009
IEEE

212views Distributed And Parallel Com...» more HPCA 2009»

A low-radix and low-diameter 3D interconnection network design

16 years 5 months ago

Download www.cs.pitt.edu

Interconnection plays an important role in performance and power of CMP designs using deep sub-micron technology. The network-on-chip (NoCs) has been proposed as a scalable and hi...

Bo Zhao, Jun Yang 0002, Xiuyi Zhou, Yi Xu, Youtao ...

claim paper

Read More »

131

click to vote

HPCA
2009
IEEE

159views Distributed And Parallel Com...» more HPCA 2009»

Fast complete memory consistency verification

16 years 5 months ago

Download lcs.ios.ac.cn

The verification of an execution against memory consistency is known to be NP-hard. This paper proposes a novel fast memory consistency verification method by identifying a new na...

Yunji Chen, Yi Lv, Weiwu Hu, Tianshi Chen, Haihua ...

claim paper

Read More »

113

click to vote

HPCA
2009
IEEE

145views Distributed And Parallel Com...» more HPCA 2009»

Lightweight predication support for out of order processors

16 years 5 months ago

Download www.multicoreinfo.com

The benefits of Out of Order (OOO) processing are well known, as is the effectiveness of predicated execution for unpredictable control flow. However, as previous research has dem...

Mark Stephenson, Lixin Zhang, Ram Rangan

claim paper

Read More »

128

click to vote

HPCA
2009
IEEE

141views Distributed And Parallel Com...» more HPCA 2009»

Variation-aware dynamic voltage/frequency scaling

16 years 5 months ago

Download www.ece.cmu.edu

Fine-grained dynamic voltage/frequency scaling (DVFS) is an important tool in managing the balance between power and performance in chip-multiprocessors. Although manufacturing pr...

Sebastian Herbert, Diana Marculescu

claim paper

Read More »

138

click to vote

HPCA
2009
IEEE

166views Distributed And Parallel Com...» more HPCA 2009»

PageNUCA: Selected policies for page-grain locality management in large shared chip-multiprocessor caches

16 years 5 months ago

Download www.cse.iitk.ac.in

As the last-level on-chip caches in chip-multiprocessors increase in size, the physical locality of on-chip data becomes important for delivering high performance. The non-uniform...

Mainak Chaudhuri

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers