Search Sciweavers | Sciweavers

309 search results - page 22 / 62

» Parallel Memory Architecture for Arbitrary Stride Accesses

216

click to vote

IJPP
2010

156views more IJPP 2010»

ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures

15 years 4 months ago

Download hal.inria.fr

Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform ar...

François Broquedis, Nathalie Furmento, Bric...

claim paper

Read More »

204

Voted

ICS
2005
Tsinghua U.

108views Distributed And Parallel Com...» more ICS 2005»

A heterogeneously segmented cache architecture for a packet forwarding engine

16 years 28 days ago

Download hpc.serc.iisc.ernet.in

As network trafﬁc continues to increase and with the requirement to process packets at line rates, high performance routers need to forward millions of packets every second. Eve...

Kaushik Rajan, Ramaswamy Govindarajan

claim paper

Read More »

220

click to vote

SPAA
1995
ACM

142views Distributed And Parallel Com...» more SPAA 1995»

Accounting for Memory Bank Contention and Delay in High-Bandwidth Multiprocessors

15 years 11 months ago

Download theory.stanford.edu

For years, the computation rate of processors has been much faster than the access rate of memory banks, and this divergence in speeds has been constantly increasing in recent yea...

Guy E. Blelloch, Phillip B. Gibbons, Yossi Matias,...

claim paper

Read More »

189

click to vote

IPPS
2009
IEEE

168views Distributed And Parallel Com...» more IPPS 2009»

Exploiting DMA to enable non-blocking execution in Decoupled Threaded Architecture

16 years 2 months ago

Download www.dii.unisi.it

DTA (Decoupled Threaded Architecture) is designed to exploit ﬁne/medium grained Thread Level Parallelism (TLP) by using a distributed hardware scheduling unit and relying on exi...

Roberto Giorgi, Zdravko Popovic, Nikola Puzovic

claim paper

Read More »

192

click to vote

IEEEPACT
2002
IEEE

97views Distributed And Parallel Com...» more IEEEPACT 2002»

Compiler-Controlled Caching in Superword Register Files for Multimedia Extension Architectures

16 years 10 days ago

Download www.mcs.anl.gov

In this paper, we describe an algorithm and implementation of locality optimizations for architectures with instruction sets such as Intel’s SSE and Motorola’s AltiVec that su...

Jaewook Shin, Jacqueline Chame, Mary W. Hall

claim paper

Read More »

« Prev « First page 22 / 62 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers