In programming high performance applications, shared address-space platforms are preferable for fine-grained computation, while distributed address-space platforms are more suita...
For decades, the serialization constraints imposed by true data dependences have been regarded as an absolute limit--the dataflow limit--on the parallel execution of serial progra...
Modern Graphic Processing Units (GPUs) provide sufficiently flexible programming models that understanding their performance can provide insight in designing tomorrow’s manyco...
Ali Bakhoda, George L. Yuan, Wilson W. L. Fung, He...
Although shared memory programming models show good programmability compared to message passing programming models, their implementation by page-based software distributed shared m...
This paper outlines continuing work on the intelligent design catalogue. The intelligent design catalogue seeks to create a virtual design environment that is linked to a catalogu...