Search Sciweavers | Sciweavers

62 search results - page 3 / 13

» Adaptive memory programming for matrix bandwidth minimizatio...

164

click to vote

ASAP
2004
IEEE

119views Hardware» more ASAP 2004»

Automatic Synthesis of Customized Local Memories for Multicluster Application Accelerators

15 years 10 months ago

Download cccp.eecs.umich.edu

Distributed local memories, or scratchpads, have been shown to effectively reduce cost and power consumption of application-specific accelerators while maintaining performance. Th...

Manjunath Kudlur, Kevin Fan, Michael L. Chu, Scott...

claim paper

Read More »

196

click to vote

IEEEHPCS
2010

165views Applied Computing» more IEEEHPCS 2010»

Reducing memory requirements of stream programs by graph transformations

15 years 4 months ago

Download www.sifflez.org

Stream languages explicitly describe fork-join parallelism and pipelines, offering a powerful programming model for many-core Multi-Processor Systems on Chip (MPSoC). In an embedd...

Pablo de Oliveira Castro, Stéphane Louise, ...

claim paper

Read More »

205

Voted

EUROPAR
2010
Springer

189views Distributed And Parallel Com...» more EUROPAR 2010»

Optimized Dense Matrix Multiplication on a Many-Core Architecture

15 years 7 months ago

Download www.capsl.udel.edu

Abstract. Traditional parallel programming methodologies for improving performance assume cache-based parallel systems. However, new architectures, like the IBM Cyclops-64 (C64), b...

Elkin Garcia, Ioannis E. Venetis, Rishi Khan, Guan...

claim paper

Read More »

178

click to vote

BIBE
2007
IEEE

150views Bioinformatics» more BIBE 2007»

Differential Scoring for Systolic Sequence Alignment

16 years 1 months ago

Download www.simile.ca

Systolic implementations of dynamic programming solutions that utilize a similarity matrix can achieve appreciable performance with both course- and fine-grain parallelization. A ...

Antonio E. de la Serna

claim paper

Read More »

183

click to vote

SIAMSC
2010

120views more SIAMSC 2010»

Weighted Matrix Ordering and Parallel Banded Preconditioners for Iterative Linear System Solvers

15 years 5 months ago

Download www.cs.purdue.edu

The emergence of multicore architectures and highly scalable platforms motivates the development of novel algorithms and techniques that emphasize concurrency and are tolerant of ...

Murat Manguoglu, Mehmet Koyutürk, Ahmed H. Sa...

claim paper

Read More »

« Prev « First page 3 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers