Sciweavers

657 search results - page 102 / 132
» Analysis of Multithreaded Architectures for Parallel Computi...
Sort
View
IPPS
2008
IEEE
14 years 1 months ago
Lattice Boltzmann simulation optimization on leading multicore platforms
We present an auto-tuning approach to optimize application performance on emerging multicore architectures. The methodology extends the idea of searchbased performance optimizatio...
Samuel Williams, Jonathan Carter, Leonid Oliker, J...
HPCA
2008
IEEE
14 years 8 months ago
Roughness of microarchitectural design topologies and its implications for optimization
Recent advances in statistical inference and machine learning close the divide between simulation and classical optimization, thereby enabling more rigorous and robust microarchit...
Benjamin C. Lee, David M. Brooks
HPCA
1997
IEEE
13 years 11 months ago
ATM and Fast Ethernet Network Interfaces for User-Level Communication
Fast Ethernet and ATM are two attractive network technologies for interconnecting workstation clusters for parallel and distributed computing. This paper compares network interfac...
Matt Welsh, Anindya Basu, Thorsten von Eicken
HIPC
2009
Springer
13 years 5 months ago
A performance prediction model for the CUDA GPGPU platform
The significant growth in computational power of modern Graphics Processing Units(GPUs) coupled with the advent of general purpose programming environments like NVIDA's CUDA,...
Kishore Kothapalli, Rishabh Mukherjee, M. Suhail R...
ESCIENCE
2005
IEEE
14 years 1 months ago
A Platform for Distributed Analysis of Neuroimaging Data on Global Grids
This paper presents a Grid environment developed for analysis of MRI brain data on global Grids. In the current experiment, the MRI data analysis tasks are composed and formulated...
Scott Kolbe, Tianchi Ma, Wei Liu, Wee Siong Soh, R...