Sciweavers

2784 search results - page 104 / 557
» Instruction Level Parallelism
Sort
View
108
Voted
EUROPAR
1997
Springer
15 years 6 months ago
The Performance Potential of Value and Dependence Prediction
Abstract. The serialization constraints induced by the detection and enforcement of true data dependences have always been regarded as requirements for correct execution. We propos...
Mikko H. Lipasti, John Paul Shen
156
Voted
VLSID
2003
IEEE
183views VLSI» more  VLSID 2003»
16 years 3 months ago
Design of a 2D DCT/IDCT application specific VLIW processor supporting scaled and sub-sampled blocks
We present an innovative design of an accurate, 2D DCT IDCT processor, which handles scaled and sub-sampled input blocks efficiently. In the IDCT mode, the latency of the processo...
Rohini Krishnan, Om Prakash Gangwal, Jos T. J. van...
100
Voted
ARC
2009
Springer
102views Hardware» more  ARC 2009»
15 years 9 months ago
A Parallel Branching Program Machine for Emulation of Sequential Circuits
The parallel branching program machine (PBM128) consists of 128 branching program machines (BMs) and a programmable interconnection. To represent logic functions on BMs, we use qua...
Hiroki Nakahara, Tsutomu Sasao, Munehiro Matsuura,...
CF
2006
ACM
15 years 6 months ago
Landing openMP on cyclops-64: an efficient mapping of openMP to a many-core system-on-a-chip
This paper presents our experience mapping OpenMP parallel programming model to the IBM Cyclops-64 (C64) architecture. The C64 employs a many-core-on-a-chip design that integrates...
Juan del Cuvillo, Weirong Zhu, Guang R. Gao
104
Voted
CCGRID
2008
IEEE
15 years 9 months ago
Application-Level Fault-Tolerance Solutions for Grid Computing
Daniel Díaz, Xoán C. Pardo, Mar&iacu...