Search Sciweavers | Sciweavers

1001 search results - page 43 / 201

» Improving memory hierarchy performance for irregular applica...

216

click to vote

PPOPP
2009
ACM

358views Distributed and Parallel Com...» more PPOPP 2009»

OpenMP to GPGPU: a compiler framework for automatic translation and optimization

16 years 6 months ago

Download www.multicoreinfo.com

GPGPUs have recently emerged as powerful vehicles for generalpurpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from N...

Seyong Lee, Seung-Jai Min, Rudolf Eigenmann

claim paper

Read More »

169

click to vote

PPOPP
2010
ACM

191views Distributed and Parallel Com...» more PPOPP 2010»

Scalable communication protocols for dynamic sparse data exchange

16 years 3 months ago

Download www.unixer.de

Many large-scale parallel programs follow a bulk synchronous parallel (BSP) structure with distinct computation and communication phases. Although the communication phase in such ...

Torsten Hoefler, Christian Siebert, Andrew Lumsdai...

claim paper

Read More »

154

click to vote

ICS
1998
Tsinghua U.

95views Distributed And Parallel Com...» more ICS 1998»

Data Prefetching for Software DSMs

15 years 10 months ago

Download www.cos.ufrj.br

In this paper we propose and evaluate the Adaptive++ technique, a novel runtime-only data prefetching strategy for software-based distributed shared-memory systems (software DSMs)...

Ricardo Bianchini, Raquel Pinto, Claudio Luis de A...

claim paper

Read More »

142

click to vote

MICRO
2006
IEEE

79views Hardware» more MICRO 2006»

Fair Queuing Memory Systems

16 years 3 days ago

Download pages.cs.wisc.edu

We propose and evaluate a multi-thread memory scheduler that targets high performance CMPs. The proposed memory scheduler is based on concepts originally developed for network fai...

Kyle J. Nesbit, Nidhi Aggarwal, James Laudon, Jame...

claim paper

Read More »

159

click to vote

DAC
2001
ACM

122views Computer Architecture» more DAC 2001»

Performance-Driven Multi-Level Clustering with Application to Hierarchical FPGA Mapping

16 years 7 months ago

Download www.gigascale.org

In this paper, we study the problem of performance-driven multi-level circuit clustering with application to hierarchical FPGA designs. We first show that the performance-driven m...

Jason Cong, Michail Romesis

claim paper

Read More »

« Prev « First page 43 / 201 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers