Sciweavers

280 search results - page 18 / 56
» Challenges in exploitation of loop parallelism in embedded a...
Sort
View
ICPP
2009
IEEE
14 years 3 months ago
Speeding Up Distributed MapReduce Applications Using Hardware Accelerators
—In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogeneous at multiple levels: from asymmetric processors, to different system archi...
Yolanda Becerra, Vicenç Beltran, David Carr...
SC
2005
ACM
14 years 2 months ago
Transformations to Parallel Codes for Communication-Computation Overlap
This paper presents program transformations directed toward improving communication-computation overlap in parallel programs that use MPI’s collective operations. Our transforma...
Anthony Danalis, Ki-Yong Kim, Lori L. Pollock, D. ...
ICCD
2001
IEEE
140views Hardware» more  ICCD 2001»
14 years 5 months ago
Cost-effective Hardware Acceleration of Multimedia Applications
General-purpose microprocessors augmented with SIMD execution units enhance multimedia applications by exploiting data level parallelism. However, supporting/overhead related inst...
Deependra Talla, Lizy Kurian John
CODES
2008
IEEE
13 years 10 months ago
Application specific non-volatile primary memory for embedded systems
Memory subsystems have been considered as one of the most critical components in embedded systems and furthermore, displaying increasing complexity as application requirements div...
Kwangyoon Lee, Alex Orailoglu
EUROPAR
2004
Springer
14 years 1 months ago
Exploiting Differentiated Tuple Distribution in Shared Data Spaces
The shared data space model has proven to be an effective paradigm for building distributed applications. However, building an efficient distributed implementation remains a chall...
Giovanni Russello, Michel R. V. Chaudron, Maarten ...