Efficient vectorization of SIMD programs with non-aligned and irregular data access hardware

15 years 7 months ago

Download www.cecs.uci.edu

Automatic vectorization of programs for partitioned-ALU SIMD (Single Instruction Multiple Data) processors has been difficult because of not only data dependency issues but also non-aligned and irregular data access problems. A nonaligned or irregular data access operation incurs many overhead cycles for data alignment. Moreover, this causes difficulty in efficient code generation and hinders automatic vectorization. In this paper, we employ special memory access hardware for improving the performance of SIMD processors; one is the split line buffer and the other is the packing buffer. The former solves the non-aligned memory access problem, while the latter simplifies irregular and stride data access. The addition of these hardware units not only requires very small changes to the instruction set architecture but also contributes to the significant performance improvement by vectorizing more loops and reducing the overhead cycles. We have also developed an auto-vectorization compiler...

Hoseok Chang, Wonyong Sung

Real-time Traffic

Automatic Vectorization | CASES 2008 | Irregular Data Access | Split Line Buffer | System Software |

claim paper

» Speculative Dynamic Vectorization

» Designing irregular parallel algorithms with mutual exclusion and lockfree protocols

» FAST fast architecture sensitive tree search on modern CPUs and GPUs

» CUDA compatible GPU cards as efficient hardware accelerators for SmithWaterman sequence al...

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CASES
Authors	Hoseok Chang, Wonyong Sung

Comments (0)

Sciweavers

Efficient vectorization of SIMD programs with non-aligned and irregular data access hardware

Automatic Vectorization | CASES 2008 | Irregular Data Access | Split Line Buffer | System Software |

Explore & Download

Productivity Tools

Sciweavers