Search Sciweavers | Sciweavers

3379 search results - page 229 / 676

» Parallel cross-entropy optimization

149

click to vote

EUROPAR
2010
Springer

146views Distributed And Parallel Com...» more EUROPAR 2010»

Optimized On-Chip-Pipelined Mergesort on the Cell/B.E

15 years 2 months ago

Download www.ida.liu.se

Abstract. Limited bandwidth to off-chip main memory is a performance bottleneck in chip multiprocessors for streaming computations, such as Cell/B.E., and this will become even mor...

Rikard Hultén, Christoph W. Kessler, Jö...

claim paper

Read More »

159

Voted

PPOPP
2009
ACM

358views Distributed and Parallel Com...» more PPOPP 2009»

OpenMP to GPGPU: a compiler framework for automatic translation and optimization

16 years 3 months ago

Download www.multicoreinfo.com

GPGPUs have recently emerged as powerful vehicles for generalpurpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from N...

Seyong Lee, Seung-Jai Min, Rudolf Eigenmann

claim paper

Read More »

105

click to vote

CGO
2003
IEEE

87views Software Engineering» more CGO 2003»

Optimizing Memory Accesses For Spatial Computation

15 years 7 months ago

Download www-2.cs.cmu.edu

In this paper we present the internal representation and optimizations used by the CASH compiler for improving the memory parallelism of pointer-based programs. CASH uses an SSA-b...

Mihai Budiu, Seth Copen Goldstein

claim paper

Read More »

150

click to vote

ASPLOS
2008
ACM

186views Programming Languages» more ASPLOS 2008»

Communication optimizations for global multi-threaded instruction scheduling

15 years 4 months ago

Download liberty.princeton.edu

The recent shift in the industry towards chip multiprocessor (CMP) designs has brought the need for multi-threaded applications to mainstream computing. As observed in several lim...

Guilherme Ottoni, David I. August

claim paper

Read More »

108

click to vote

EUROPAR
2009
Springer

122views Distributed And Parallel Com...» more EUROPAR 2009»

A Case Study of Communication Optimizations on 3D Mesh Interconnects

15 years 9 months ago

Download charm.cs.uiuc.edu

Optimal network performance is critical to eﬃcient parallel scaling for communication-bound applications on large machines. With wormhole routing, no-load latencies do not increa...

Abhinav Bhatele, Eric J. Bohm, Laxmikant V. Kal&ea...

claim paper

Read More »

« Prev « First page 229 / 676 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers