Performance evaluation of concurrent collections on high-performance multicore computing systems

14 years 29 days ago

Download vuduc.org

This paper is the first extensive performance study of a recently proposed parallel programming model, called Concurrent Collections (CnC). In CnC, the programmer expresses her computation in terms of application-specific operations, partially-ordered by semantic scheduling constraints. The CnC model is well-suited to expressing asynchronous-parallel algorithms, so we evaluate CnC using two dense linear algebra algorithms in this style for execution on state-of-the-art multicore systems: (i) a recently proposed asynchronous-parallel Cholesky factorization algorithm, (ii) a novel and non-trivial "higher-level" partly-asynchronous generalized eigensolver for dense symmetric matrices. Given a well-tuned sequential BLAS, our implementations match or exceed competing multithreaded vendor-tuned codes by up to 2.6

Aparna Chandramowlishwaran, Kathleen Knobe, Richar

Real-time Traffic

Cholesky Factorization Algorithm | Dense Symmetric Matrices | Distributed And Parallel Computing | IPPS 2010 | Semantic Scheduling Constraints |

claim paper

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	IPPS
Authors	Aparna Chandramowlishwaran, Kathleen Knobe, Richard W. Vuduc

Comments (0)

Sciweavers

Performance evaluation of concurrent collections on high-performance multicore computing systems

Cholesky Factorization Algorithm | Dense Symmetric Matrices | Distributed And Parallel Computing | IPPS 2010 | Semantic Scheduling Constraints |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers