Search Sciweavers | Sciweavers

1075 search results - page 156 / 215

» Parallel Programming with Transactional Memory

149

Voted

EUROPAR
2010
Springer

189views Distributed And Parallel Com...» more EUROPAR 2010»

Optimized Dense Matrix Multiplication on a Many-Core Architecture

15 years 3 months ago

Download www.capsl.udel.edu

Abstract. Traditional parallel programming methodologies for improving performance assume cache-based parallel systems. However, new architectures, like the IBM Cyclops-64 (C64), b...

Elkin Garcia, Ioannis E. Venetis, Rishi Khan, Guan...

claim paper

Read More »

122

click to vote

IPPS
2007
IEEE

143views Distributed And Parallel Com...» more IPPS 2007»

Optimizing Inter-Nest Data Locality Using Loop Splitting and Reordering

15 years 8 months ago

Download www.cecs.uci.edu

With the increasing gap between processor speed and memory latency, the performance of data-dominated programs are becoming more reliant on fast data access, which can be improved...

Sofiane Naci

claim paper

Read More »

click to vote

PVM
2009
Springer

114views Distributed And Parallel Com...» more PVM 2009»

MPI on a Million Processors

15 years 8 months ago

Download www.mcs.anl.gov

Petascale machines with close to a million processors will soon be available. Although MPI is the dominant programming model today, some researchers and users wonder (and perhaps e...

Pavan Balaji, Darius Buntinas, David Goodell, Will...

claim paper

Read More »

click to vote

ISNN
2005
Springer

134views Neural Networks» more ISNN 2005»

A SIMD Neural Network Processor for Image Processing

15 years 7 months ago

Download cad.yonsei.ac.kr

Abstract. Artiﬁcial Neural Networks (ANNs) and image processing requires massively parallel computation of simple operator accompanied by heavy memory access. Thus, this type of ...

Dongsun Kim, Hyunsik Kim, Hongsik Kim, Gunhee Han,...

claim paper

Read More »

115

click to vote

ICPP
2003
IEEE

82views Distributed And Parallel Com...» more ICPP 2003»

Procedural Level Address Offset Assignment of DSP Applications with Loops

15 years 7 months ago

Download www.cs.pitt.edu

Automatic optimization of address offset assignment for DSP applications, which reduces the number of address arithmetic instructions to meet the tight memory size restrictions an...

Youtao Zhang, Jun Yang 0002

claim paper

Read More »

« Prev « First page 156 / 215 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers