Search Sciweavers | Sciweavers

135 search results - page 15 / 27

» Code and Data Transformations for Improving Shared Cache Per...

114

click to vote

MICRO
2006
IEEE

145views Hardware» more MICRO 2006»

ASR: Adaptive Selective Replication for CMP Caches

15 years 10 months ago

Download www.cs.wisc.edu

The large working sets of commercial and scientiﬁc workloads stress the L2 caches of Chip Multiprocessors (CMPs). Some CMPs use a shared L2 cache to maximize the on-chip cache c...

Bradford M. Beckmann, Michael R. Marty, David A. W...

claim paper

Read More »

131

click to vote

ICS
2009
Tsinghua U.

152views Distributed And Parallel Com...» more ICS 2009»

Computer generation of fast fourier transforms for the cell broadband engine

15 years 10 months ago

Download spiral.ece.cmu.edu

The Cell BE is a multicore processor with eight vector accelerators (called SPEs) that implement explicit cache management through direct memory access engines. While the Cell has...

Srinivas Chellappa, Franz Franchetti, Markus P&uum...

claim paper

Read More »

129

click to vote

CAINE
2006

102views Computer Science» more CAINE 2006»

A multiobjective evolutionary approach for constrained joint source code optimization

15 years 5 months ago

Download publik.tuwien.ac.at

The synergy of software and hardware leads to efficient application expression profile (AEP) not only in terms of execution time and energy but also optimal architecture usage. We...

Naeem Zafar Azeemi

claim paper

Read More »

154

click to vote

ICPP
1999
IEEE

114views Distributed And Parallel Com...» more ICPP 1999»

A Framework for Interprocedural Locality Optimization Using Both Loop and Data Layout Transformations

15 years 8 months ago

Download cucis.ece.northwestern.edu

There has been much work recently on improving the locality performance of loop nests in scientific programs through the use of loop as well as data layout optimizations. However,...

Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanuja...

claim paper

Read More »

223

click to vote

IWMM
2011
Springer

270views Hardware» more IWMM 2011»

Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead

14 years 6 months ago

Download people.inf.ethz.ch

Multiprocessors based on processors with multiple cores usually include a non-uniform memory architecture (NUMA); even current 2-processor systems with 8 cores exhibit non-uniform...

Zoltan Majo, Thomas R. Gross

claim paper

Read More »

« Prev « First page 15 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers