Vector Lane Threading

15 years 11 months ago

Download rivoire.cs.sonoma.edu

Multi-lane vector processors achieve excellent computational throughput for programs with high data-level parallelism (DLP). However, application phases without significant DLP are unable to fully utilize the datapaths in the vector lanes. In this paper, we propose vector lane threading (VLT), an architectural enhancement that allows idle vector lanes to run short-vector or scalar threads. VLTenhanced vector hardware can exploit both data-level and thread-level parallelism to achieve higher performance. We investigate implementation alternatives for VLT, focusing mostly on the instruction issue bandwidth requirements. We demonstrate that VLT’s area overhead is small. For applications with short vectors, VLT leads to additional speedup

Suzanne Rivoire, Rebecca Schultz, Tomofumi Okuda,

Real-time Traffic

Distributed And Parallel Computing | ICPP 2006 | Idle Vector Lanes | Multi-lane Vector Processors | Vector Lanes |

claim paper

» The Instruction Execution Mechanism for Responsive Multithreaded Processor

» CAPRI Prediction of compactionadequacy for handling controldivergence in GPGPU architectur...

» A Thread and DataParallel MPEG4 Video Encoder for a SystemOnChip Multiprocessor

» On Checking Versus Evaluation of Multiple Queries

» Finegrain performance scaling of soft vector processors

» Vector processing as a softcore CPU accelerator

» A Thread Algebra with Multilevel Strategic Interleaving

» Bit vector algorithms enabling highspeed and memoryefficient firewall blacklisting

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICPP
Authors	Suzanne Rivoire, Rebecca Schultz, Tomofumi Okuda, Christos Kozyrakis

Comments (0)

Sciweavers

Vector Lane Threading

Distributed And Parallel Computing | ICPP 2006 | Idle Vector Lanes | Multi-lane Vector Processors | Vector Lanes |

Explore & Download

Productivity Tools

Sciweavers