Sciweavers

ASAP
2007
IEEE

SIMD Vectorization of Histogram Functions

14 years 5 months ago
SIMD Vectorization of Histogram Functions
Existing SIMD extensions cannot efficiently vectorize the histogram function due to memory collisions. We propose two techniques to avoid this problem. In the first, a hierarchical structure of three levels is proposed. In order to provide n-way parallelism, auxiliary arrays that have n and n/2 subarrays are used in the first and second level, respectively. The last level has the primary histogram array. Indirect SIMD load and store instructions are designed in order to access different elements of different subarrays. The different subarrays in the lower levels are merged and finally at the end, the calculated results are stored in the primary histogram array. In the second method, parallel comparators are used in order to count the number of subwords within a media register that are the same. Thereafter, these numbers are added to the values of the histogram array simultaneously. Experimental results obtained by extending the SimpleScalar toolset show that proposed techniques im...
Asadollah Shahbahrami, Ben H. H. Juurlink, Stamati
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where ASAP
Authors Asadollah Shahbahrami, Ben H. H. Juurlink, Stamatis Vassiliadis
Comments (0)