Search Sciweavers | Sciweavers

720 search results - page 98 / 144

» Uniform Memory Hierarchies

130

click to vote

SC
2004
ACM

98views Applied Computing» more SC 2004»

Big Wins with Small Application-Aware Caches

15 years 10 months ago

Download www.cs.cmu.edu

Large datasets, on the order of GB and TB, are increasingly common as abundant computational resources allow practitioners to collect, produce and store data at higher rates. As d...

Julio C. López, David R. O'Hallaron, Tianka...

claim paper

Read More »

153

click to vote

PARA
2004
Springer

230views Applied Computing» more PARA 2004»

A Family of High-Performance Matrix Multiplication Algorithms

15 years 9 months ago

Download userweb.cs.utexas.edu

During the last half-decade, a number of research eﬀorts have centered around developing software for generating automatically tuned matrix multiplication kernels. These include ...

John A. Gunnels, Fred G. Gustavson, Greg Henry, Ro...

claim paper

Read More »

127

click to vote

ICPP
2003
IEEE

104views Distributed And Parallel Com...» more ICPP 2003»

Scheduling Algorithms with Bus Bandwidth Considerations for SMPs

15 years 9 months ago

Download people.cs.vt.edu

The bus that connects processors to memory is known to be a major architectural bottleneck in SMPs. However, both software and scheduling policies for these systems generally focu...

Christos D. Antonopoulos, Dimitrios S. Nikolopoulo...

claim paper

Read More »

131

click to vote

HPCA
2002
IEEE

100views Distributed And Parallel Com...» more HPCA 2002»

Non-Vital Loads

15 years 9 months ago

Download www.hpcaconf.org

As the frequency gap between main memory and modern microprocessor grows, the implementation and efficiency of on-chip caches become more important. The growing latency to memory ...

Ryan Rakvic, Bryan Black, Deepak Limaye, John Paul...

claim paper

Read More »

149

click to vote

ISSS
2002
IEEE

151views Hardware» more ISSS 2002»

Tuning of Loop Cache Architectures to Programs in Embedded System Design

15 years 9 months ago

Download www.cecs.uci.edu

Adding a small loop cache to a microprocessor has been shown to reduce average instruction fetch energy for various sets of embedded system applications. With the advent of core-b...

Frank Vahid, Susan Cotterell

claim paper

Read More »

« Prev « First page 98 / 144 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers