Sciweavers

1262 search results - page 194 / 253
» A Container-Iterator Parallel Programming Model
Sort
View
PPOPP
2003
ACM
14 years 3 months ago
User-controllable coherence for high performance shared memory multiprocessors
In programming high performance applications, shared address-space platforms are preferable for fine-grained computation, while distributed address-space platforms are more suita...
Collin McCurdy, Charles N. Fischer
MICRO
1996
IEEE
96views Hardware» more  MICRO 1996»
14 years 2 months ago
Exceeding the Dataflow Limit via Value Prediction
For decades, the serialization constraints imposed by true data dependences have been regarded as an absolute limit--the dataflow limit--on the parallel execution of serial progra...
Mikko H. Lipasti, John Paul Shen
ISPASS
2009
IEEE
14 years 4 months ago
Analyzing CUDA workloads using a detailed GPU simulator
Modern Graphic Processing Units (GPUs) provide sufficiently flexible programming models that understanding their performance can provide insight in designing tomorrow’s manyco...
Ali Bakhoda, George L. Yuan, Wilson W. L. Fung, He...
CCGRID
2010
IEEE
13 years 11 months ago
Region-Based Prefetch Techniques for Software Distributed Shared Memory Systems
Although shared memory programming models show good programmability compared to message passing programming models, their implementation by page-based software distributed shared m...
Jie Cai, Peter E. Strazdins, Alistair P. Rendell
EWC
2011
84views more  EWC 2011»
13 years 5 months ago
A theoretical framework for an intelligent design catalogue
This paper outlines continuing work on the intelligent design catalogue. The intelligent design catalogue seeks to create a virtual design environment that is linked to a catalogu...
Paul Winkelman