Search Sciweavers | Sciweavers

16 search results - page 3 / 4

» A Multi-Threaded Streaming Pipeline Architecture for Large S...

click to vote

HPCA
2009
IEEE

156views Distributed And Parallel Com...» more HPCA 2009»

Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems

14 years 8 months ago

Download www.ece.cmu.edu

Linked data structure (LDS) accesses are critical to the performance of many large scale applications. Techniques have been proposed to prefetch such accesses. Unfortunately, many...

Eiman Ebrahimi, Onur Mutlu, Yale N. Patt

claim paper

Read More »

click to vote

CASES
2006
ACM

164views System Software» more CASES 2006»

Improving the performance and power efficiency of shared helpers in CMPs

13 years 11 months ago

Download www.cs.york.ac.uk

Technology scaling trends have forced designers to consider alternatives to deeply pipelining aggressive cores with large amounts of performance accelerating hardware. One alterna...

Anahita Shayesteh, Glenn Reinman, Norman P. Jouppi...

claim paper

Read More »

click to vote

PARA
1995
Springer

174views Applied Computing» more PARA 1995»

A Proposal for a Set of Parallel Basic Linear Algebra Subprograms

13 years 11 months ago

Download phase.hpcc.jp

This paper describes a proposal for a set of Parallel Basic Linear Algebra Subprograms PBLAS. The PBLAS are targeted at distributed vector-vector, matrix-vector and matrixmatrix...

Jaeyoung Choi, Jack Dongarra, Susan Ostrouchov, An...

claim paper

Read More »

click to vote

ISCA
1999
IEEE

110views Hardware» more ISCA 1999»

Decoupling Local Variable Accesses in a Wide-Issue Superscalar Processor

13 years 11 months ago

Download www.cs.pitt.edu

Providing adequate data bandwidth is extremely important for a wide-issue superscalar processor to achieve its full performance potential. Adding a large number of ports to a data...

Sangyeun Cho, Pen-Chung Yew, Gyungho Lee

claim paper

Read More »

click to vote

EUROGRAPHICS
2010
Eurographics

356views Computer Graphics» more EUROGRAPHICS 2010»

Fast Ray Sorting and Breadth-First Packet Traversal for GPU Ray Tracing

14 years 3 months ago

Download research.microsoft.com

We present a novel approach to ray tracing execution on commodity graphics hardware using CUDA. We decompose a standard ray tracing algorithm into several data-parallel stages tha...

Kirill Garanzha and Charles Loop

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers