Search Sciweavers | Sciweavers

459 search results - page 51 / 92

» Using Kernel Couplings to Predict Parallel Application Perfo...

159

click to vote

IEEEPACT
2002
IEEE

149views Distributed And Parallel Com...» more IEEEPACT 2002»

Optimizing Loop Performance for Clustered VLIW Architectures

15 years 9 months ago

Download www.cs.mtu.edu

Modern embedded systems often require high degrees of instruction-level parallelism (ILP) within strict constraints on power consumption and chip cost. Unfortunately, a high-perfo...

Yi Qian, Steve Carr, Philip H. Sweany

claim paper

Read More »

141

click to vote

ICML
1997
IEEE

127views Machine Learning» more ICML 1997»

Predicting Multiprocessor Memory Access Patterns with Learning Models

16 years 5 months ago

Download clgiles.ist.psu.edu

Machine learning techniques are applicable to computer system optimization. We show that shared memory multiprocessors can successfully utilize machine learning algorithms for mem...

M. F. Sakr, Steven P. Levitan, Donald M. Chiarulli...

claim paper

Read More »

169

click to vote

HPCA
2009
IEEE

328views Distributed And Parallel Com...» more HPCA 2009»

Prediction router: Yet another low latency on-chip router architecture

16 years 5 months ago

Download www.am.ics.keio.ac.jp

Network-on-Chips (NoCs) are quite latency sensitive, since their communication latency strongly affects the application performance on recent many-core architectures. To reduce th...

Hiroki Matsutani, Michihiro Koibuchi, Hideharu Ama...

claim paper

Read More »

138

click to vote

IPPS
2007
IEEE

143views Distributed And Parallel Com...» more IPPS 2007»

Optimizing Inter-Nest Data Locality Using Loop Splitting and Reordering

15 years 10 months ago

Download www.cecs.uci.edu

With the increasing gap between processor speed and memory latency, the performance of data-dominated programs are becoming more reliant on fast data access, which can be improved...

Sofiane Naci

claim paper

Read More »

176

click to vote

IPPS
2009
IEEE

900views Distributed And Parallel Com...» more IPPS 2009»

Singular value decomposition on GPU using CUDA

15 years 11 months ago

Download web2py.iiit.ac.in

Linear algebra algorithms are fundamental to many computing applications. Modern GPUs are suited for many general purpose processing tasks and have emerged as inexpensive high per...

Sheetal Lahabar, P. J. Narayanan

claim paper

Read More »

« Prev « First page 51 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers