Search Sciweavers | Sciweavers

20

MICRO
2010
IEEE

153views Hardware» more MICRO 2010»

Throughput-Effective On-Chip Networks for Manycore Accelerators

13 years 5 months ago

As the number of cores and threads in manycore compute accelerators such as Graphics Processing Units (GPU) increases, so does the importance of on-chip interconnection network des...

Ali Bakhoda, John Kim, Tor M. Aamodt

claim paper

Read More »

25

click to vote

PLDI
1995
ACM

122views Programming Languages» more PLDI 1995»

Improving Balanced Scheduling with Compiler Optimizations that Increase Instruction-Level Parallelism

13 years 11 months ago

Download reference.kfupm.edu.sa

Traditional list schedulers order instructions based on an optimistic estimate of the load latency imposed by the hardware and therefore cannot respond to variations in memory lat...

Jack L. Lo, Susan J. Eggers

claim paper

Read More »

24

click to vote

PPOPP
2010
ACM

216views Distributed and Parallel Com...» more PPOPP 2010»

Structure-driven optimizations for amorphous data-parallel programs

14 years 4 months ago

Download users.ices.utexas.edu

Irregular algorithms are organized around pointer-based data structures such as graphs and trees, and they are ubiquitous in applications. Recent work by the Galois project has pr...

Mario Méndez-Lojo, Donald Nguyen, Dimitrios...

claim paper

Read More »

27

click to vote

ICS
2001
Tsinghua U.

142views Distributed And Parallel Com...» more ICS 2001»

Global optimization techniques for automatic parallelization of hybrid applications

14 years 2 days ago

Download www.ensta.fr

This paper presents a novel technique to perform global optimization of communication and preprocessing calls in the presence of array accesses with arbitrary subscripts. Our sche...

Dhruva R. Chakrabarti, Prithviraj Banerjee

claim paper

Read More »

22

click to vote

ICPP
1998
IEEE

135views Distributed And Parallel Com...» more ICPP 1998»

Concurrent SSA Form in the Presence of Mutual Exclusion

13 years 12 months ago

Download www.airs.com

Most current compiler analysis techniques are unable to cope with the semantics introduced by explicit parallel and synchronization constructs in parallel programs. In this paper ...

Diego Novillo, Ronald C. Unrau, Jonathan Schaeffer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers