Search Sciweavers | Sciweavers

121

PPOPP
2010
ACM

222views Distributed and Parallel Com...» more PPOPP 2010»

Scaling LAPACK panel operations using parallel cache assignment

16 years 14 days ago

In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high perf...

Anthony M. Castaldo, R. Clint Whaley

claim paper

Read More »

123

click to vote

PPOPP
2010
ACM

210views Distributed and Parallel Com...» more PPOPP 2010»

Scheduling support for transactional memory contention management

16 years 14 days ago

Download www.cs.bgu.ac.il

Transactional Memory (TM) is considered as one of the most promising paradigms for developing concurrent applications. TM has been shown to scale well on multiple cores when the d...

Walther Maldonado, Patrick Marlier, Pascal Felber,...

claim paper

Read More »

143

click to vote

PPOPP
2010
ACM

259views Distributed and Parallel Com...» more PPOPP 2010»

An adaptive performance modeling tool for GPU architectures

15 years 10 months ago

Download impact.crhc.illinois.edu

This paper presents an analytical model to predict the performance of general-purpose applications on a GPU architecture. The model is designed to provide performance information ...

Sara S. Baghsorkhi, Matthieu Delahaye, Sanjay J. P...

claim paper

Read More »

130

click to vote

PPOPP
2010
ACM

308views Distributed and Parallel Com...» more PPOPP 2010»

Thread to strand binding of parallel network applications in massive multi-threaded systems

15 years 9 months ago

Download capinfo.e.ac.upc.edu

In processors with several levels of hardware resource sharing, like CMPs in which each core is an SMT, the scheduling process becomes more complex than in processors with a singl...

Petar Radojkovic, Vladimir Cakarevic, Javier Verd&...

claim paper

Read More »

113

click to vote

PPOPP
2010
ACM

140views Distributed and Parallel Com...» more PPOPP 2010»

Helper locks for fork-join parallel programming

15 years 5 months ago

Download people.csail.mit.edu

Helper locks allow programs with large parallel critical sections, called parallel regions, to execute more efficiently by enlisting processors that might otherwise be waiting on ...

Kunal Agrawal, Charles E. Leiserson, Jim Sukha

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers