Search Sciweavers | Sciweavers

656 search results - page 55 / 132

» Scalable Parallel Matrix Multiplication on Distributed Memor...

137

Voted

PPOPP
2010
ACM

353views Distributed and Parallel Com...» more PPOPP 2010»

Data transformations enabling loop vectorization on multithreaded data parallel architectures

15 years 11 months ago

Download www.ece.neu.edu

Loop vectorization, a key feature exploited to obtain high performance on Single Instruction Multiple Data (SIMD) vector architectures, is significantly hindered by irregular memo...

Byunghyun Jang, Perhaad Mistry, Dana Schaa, Rodrig...

claim paper

Read More »

click to vote

SIAMSC
2008

129views more SIAMSC 2008»

Bottom-Up Construction and 2: 1 Balance Refinement of Linear Octrees in Parallel

15 years 2 months ago

Download www.seas.upenn.edu

Abstract. In this article, we propose new parallel algorithms for the construction and 2:1 balance refinement of large linear octrees on distributed memory machines. Such octrees a...

Hari Sundar, Rahul S. Sampath, George Biros

claim paper

Read More »

161

Voted

IPPS
2010
IEEE

148views Distributed And Parallel Com...» more IPPS 2010»

Parallelization of tau-leap coarse-grained Monte Carlo simulations on GPUs

15 years 3 days ago

Download gcl.cis.udel.edu

The Coarse-Grained Monte Carlo (CGMC) method is a multi-scale stochastic mathematical and simulation framework for spatially distributed systems. CGMC simulations are important too...

Lifan Xu, Michela Taufer, Stuart Collins, Dionisio...

claim paper

Read More »

105

Voted

ICS
2009
Tsinghua U.

143views Distributed And Parallel Com...» more ICS 2009»

Fast and scalable list ranking on the GPU

15 years 9 months ago

Download researchweb.iiit.ac.in

General purpose programming on the graphics processing units (GPGPU) has received a lot of attention in the parallel computing community as it promises to oﬀer the highest perfo...

M. Suhail Rehman, Kishore Kothapalli, P. J. Naraya...

claim paper

Read More »

135

click to vote

IPPS
1999
IEEE

156views Distributed And Parallel Com...» more IPPS 1999»

Cascaded Execution: Speeding Up Unparallelized Execution on Shared-Memory Multiprocessors

15 years 6 months ago

Download www.cs.virginia.edu

Both inherently sequential code and limitations of analysis techniques prevent full parallelization of many applications by parallelizing compilers. Amdahl's Law tells us tha...

Ruth E. Anderson, Thu D. Nguyen, John Zahorjan

claim paper

Read More »

« Prev « First page 55 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers