Search Sciweavers | Sciweavers

656 search results - page 85 / 132

» Scalable Parallel Matrix Multiplication on Distributed Memor...

103

Voted

IPPS
2005
IEEE

115views Distributed And Parallel Com...» more IPPS 2005»

TiNy Threads: A Thread Virtual Machine for the Cyclops64 Cellular Architecture

15 years 7 months ago

Download www.capsl.udel.edu

This paper presents the design and implementation of a thread virtual machine, called TNT (or TiNy-Threads) for the IBM Cyclops64 architecture (the latest Cyclops architecture tha...

Juan del Cuvillo, Weirong Zhu, Ziang Hu, Guang R. ...

claim paper

Read More »

108

Voted

PPOPP
2005
ACM

131views Distributed And Parallel Com...» more PPOPP 2005»

Revocable locks for non-blocking programming

15 years 7 months ago

Download research.microsoft.com

In this paper we present a new form of revocable lock that streamlines the construction of higher level concurrency abstractions such as atomic multi-word heap updates. The key id...

Tim Harris, Keir Fraser

claim paper

Read More »

118

Voted

IWOMP
2009
Springer

149views Programming Languages» more IWOMP 2009»

Scalability Evaluation of Barrier Algorithms for OpenMP

15 years 8 months ago

Download www2.cs.uh.edu

OpenMP relies heavily on barrier synchronization to coordinate the work of threads that are performing the computations in a parallel region. A good implementation of barriers is ...

Ramachandra C. Nanjegowda, Oscar Hernandez, Barbar...

claim paper

Read More »

118

Voted

ARC
2008
Springer

115views Hardware» more ARC 2008»

A High Throughput FPGA-based Floating Point Conjugate Gradient Implementation

15 years 4 months ago

Download cas.ee.ic.ac.uk

As Field Programmable Gate Arrays (FPGAs) have reached capacities beyond millions of equivalent gates, it becomes possible to accelerate floating-point scientific computing applica...

Antonio Roldao Lopes, George A. Constantinides

claim paper

Read More »

123

Voted

ICPP
2002
IEEE

150views Distributed And Parallel Com...» more ICPP 2002»

Analysis of Memory Hierarchy Performance of Block Data Layout

15 years 7 months ago

Download halcyon.usc.edu

Recently, several experimental studies have been conducted on block data layout as a data transformation technique used in conjunction with tiling to improve cache performance. In...

Neungsoo Park, Bo Hong, Viktor K. Prasanna

claim paper

Read More »

« Prev « First page 85 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers