Search Sciweavers | Sciweavers

141 search results - page 19 / 29

» Load Execution Latency Reduction

131

Voted

ISCA
1994
IEEE

129views Hardware» more ISCA 1994»

Impact of Sharing-Based Thread Placement on Multithreaded Architectures

15 years 6 months ago

Download www.cs.sfu.ca

Multithreaded architectures context switch between instruction streams to hide memory access latency. Although this improves processor utilization, it can increase cache interfere...

Radhika Thekkath, Susan J. Eggers

claim paper

Read More »

109

Voted

ISCA
1997
IEEE

90views Hardware» more ISCA 1997»

The Interaction of Software Prefetching with ILP Processors in Shared-Memory Systems

15 years 6 months ago

Download www.hpl.hp.com

Current microprocessors aggressively exploit instructionlevel parallelism (ILP) through techniques such as multiple issue, dynamic scheduling, and non-blocking reads. Recent work ...

Parthasarathy Ranganathan, Vijay S. Pai, Hazim Abd...

claim paper

Read More »

125

Voted

ISCA
1996
IEEE

126views Hardware» more ISCA 1996»

Memory Bandwidth Limitations of Future Microprocessors

15 years 6 months ago

Download www.cs.utexas.edu

This paper makes the case that pin bandwidth will be a critical consideration for future microprocessors. We show that many of the techniques used to tolerate growing memory laten...

Doug Burger, James R. Goodman, Alain Kägi

claim paper

Read More »

124

Voted

HIPC
2009
Springer

136views Distributed And Parallel Com...» more HIPC 2009»

Distance-aware round-robin mapping for large NUCA caches

15 years 12 days ago

Download homepages.inf.ed.ac.uk

In many-core architectures, memory blocks are commonly assigned to the banks of a NUCA cache by following a physical mapping. This mapping assigns blocks to cache banks in a round-...

Alberto Ros, Marcelo Cintra, Manuel E. Acacio, Jos...

claim paper

Read More »

129

Voted

HPCA
2005
IEEE

104views Distributed And Parallel Com...» more HPCA 2005»

Microarchitectural Wire Management for Performance and Power in Partitioned Architectures

15 years 8 months ago

Download www.cs.utah.edu

Future high-performance billion-transistor processors are likely to employ partitioned architectures to achieve high clock speeds, high parallelism, low design complexity, and low...

Rajeev Balasubramonian, Naveen Muralimanohar, Kart...

claim paper

Read More »

« Prev « First page 19 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers