Search Sciweavers | Sciweavers

955 search results - page 8 / 191

» Performance optimization of multiple memory architectures fo...

132

Voted

TPDS
2010

144views more TPDS 2010»

Performance Evaluation of Dynamic Speculative Multithreading with the Cascadia Architecture

15 years 27 days ago

Download web.engr.oregonstate.edu

—Thread-level parallelism (TLP) has been extensively studied in order to overcome the limitations of exploiting instruction-level parallelism (ILP) on high-performance superscala...

David A. Zier, Ben Lee

claim paper

Read More »

117

Voted

ICPP
2003
IEEE

82views Distributed And Parallel Com...» more ICPP 2003»

Procedural Level Address Offset Assignment of DSP Applications with Loops

15 years 7 months ago

Download www.cs.pitt.edu

Automatic optimization of address offset assignment for DSP applications, which reduces the number of address arithmetic instructions to meet the tight memory size restrictions an...

Youtao Zhang, Jun Yang 0002

claim paper

Read More »

125

Voted

IPPS
1996
IEEE

86views Distributed And Parallel Com...» more IPPS 1996»

A Method for Register Allocation to Loops in Multiple Register File Architectures

15 years 6 months ago

Download ipdps.cc.gatech.edu

Multiple instruction issue processors place high demands on register file bandwidth. One solution to reduce this bottleneck is the use of multiple register files. Register allocat...

David J. Kolson, Alexandru Nicolau, Nikil D. Dutt,...

claim paper

Read More »

152

Voted

ICPP
2009
IEEE

170views Distributed And Parallel Com...» more ICPP 2009»

Perfomance Models for Blocked Sparse Matrix-Vector Multiplication Kernels

15 years 9 months ago

Download www.cslab.ece.ntua.gr

—Sparse Matrix-Vector multiplication (SpMV) is a very challenging computational kernel, since its performance depends greatly on both the input matrix and the underlying architec...

Vasileios Karakasis, Georgios I. Goumas, Nectarios...

claim paper

Read More »

124

click to vote

MICRO
2002
IEEE

173views Hardware» more MICRO 2002»

Vector vs. superscalar and VLIW architectures for embedded multimedia benchmarks

15 years 7 months ago

Download iram.cs.berkeley.edu

Multimedia processing on embedded devices requires an architecture that leads to high performance, low power consumption, reduced design complexity, and small code size. In this p...

Christoforos E. Kozyrakis, David A. Patterson

claim paper

Read More »

« Prev « First page 8 / 191 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers