Search Sciweavers | Sciweavers

311 search results - page 5 / 63

» Software-Controlled Multithreading Using Informing Memory Op...

188

click to vote

ISCA
2008
IEEE

148views Hardware» more ISCA 2008»

Atomic Vector Operations on Chip Multiprocessors

16 years 1 months ago

Download userweb.cs.utexas.edu

The current trend is for processors to deliver dramatic improvements in parallel performance while only modestly improving serial performance. Parallel performance is harvested th...

Sanjeev Kumar, Daehyun Kim, Mikhail Smelyanskiy, Y...

claim paper

Read More »

178

click to vote

ISCA
2010
IEEE

185views Hardware» more ISCA 2010»

Dynamic warp subdivision for integrated branch and memory divergence tolerance

15 years 12 months ago

Download www.cs.virginia.edu

SIMD organizations amortize the area and power of fetch, decode, and issue logic across multiple processing units in order to maximize throughput for a given area and power budget...

Jiayuan Meng, David Tarjan, Kevin Skadron

claim paper

Read More »

164

click to vote

IPPS
2010
IEEE

134views Distributed And Parallel Com...» more IPPS 2010»

Optimization of linked list prefix computations on multithreaded GPUs using CUDA

15 years 4 months ago

Download www.umiacs.umd.edu

We present a number of optimization techniques to compute prefix sums on linked lists and implement them on multithreaded GPUs using CUDA. Prefix computations on linked structures ...

Zheng Wei, Joseph JáJá

claim paper

Read More »

192

click to vote

IWMM
2009
Springer

114views Hardware» more IWMM 2009»

Scalable support for multithreaded applications on dynamic binary instrumentation systems

16 years 1 months ago

Download www.cs.virginia.edu

Dynamic binary instrumentation systems are used to inject or modify arbitrary instructions in existing binary applications; several such systems have been developed over the past ...

Kim M. Hazelwood, Greg Lueck, Robert Cohn

claim paper

Read More »

190

click to vote

IISWC
2008
IEEE

243views Operating System» more IISWC 2008»

Accelerating multi-core processor design space evaluation using automatic multi-threaded workload synthesis

16 years 1 months ago

Download www.iiswc.org

The design and evaluation of microprocessor architectures is a difficult and time-consuming task. Although small, handcoded microbenchmarks can be used to accelerate performance e...

Clay Hughes, Tao Li

claim paper

Read More »

« Prev « First page 5 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers