memory bandwidth | Sciweavers

362

DAC
2012
ACM

216views Computer Architecture» more DAC 2012»

A QoS-aware memory controller for dynamically balancing GPU and CPU bandwidth use in an MPSoC

13 years 7 months ago

Diverse IP cores are integrated on a modern system-on-chip and share resources. Oﬀ-chip memory bandwidth is often the scarcest resource and requires careful allocation. Two of t...

Min Kyu Jeong, Mattan Erez, Chander Sudanthi, Nige...

claim paper

Read More »

165

click to vote

ISCA
2012
IEEE

218views Hardware» more ISCA 2012»

Towards energy-proportional datacenter memory with mobile DRAM

13 years 7 months ago

Download csl.stanford.edu

To increase datacenter energy efﬁciency, we need memory systems that keep pace with processor efﬁciency gains. Currently, servers use DDR3 memory, which is designed for high b...

Krishna T. Malladi, Frank A. Nothaft, Karthika Per...

claim paper

Read More »

174

Voted

ISCA
2012
IEEE

274views Hardware» more ISCA 2012»

The dynamic granularity memory system

13 years 7 months ago

Download lph.ece.utexas.edu

Chip multiprocessors enable continued performance scaling with increasingly many cores per chip. As the throughput of computation outpaces available memory bandwidth, however, the...

Doe Hyun Yoon, Min Kyu Jeong, Michael Sullivan, Ma...

claim paper

Read More »

175

click to vote

SIGCOMM
2012
ACM

211views Communications» more SIGCOMM 2012»

Multi-resource fair queueing for packet processing

13 years 7 months ago

Download www.cs.berkeley.edu

Middleboxes are ubiquitous in today’s networks and perform a variety of important functions, including IDS, VPN, ﬁrewalling, and WAN optimization. These functions differ vastl...

Ali Ghodsi, Vyas Sekar, Matei Zaharia, Ion Stoica

claim paper

Read More »

156

click to vote

FPGA
2012
ACM

285views FPGA» more FPGA 2012»

Optimizing SDRAM bandwidth for custom FPGA loop accelerators

14 years 21 days ago

Download cas.ee.ic.ac.uk

Memory bandwidth is critical to achieving high performance in many FPGA applications. The bandwidth of SDRAM memories is, however, highly dependent upon the order in which address...

Samuel Bayliss, George A. Constantinides

claim paper

Read More »

173

click to vote

CSE
2011
IEEE

192views Theoretical Computer Science» more CSE 2011»

Performance Modeling of Hybrid MPI/OpenMP Scientific Applications on Large-scale Multicore Cluster Systems

14 years 4 months ago

Download prophesy.cs.tamu.edu

In this paper, we present a performance modeling framework based on memory bandwidth contention time and a parameterized communication model to predict the performance of OpenMP, M...

Xingfu Wu, Valerie E. Taylor

claim paper

Read More »

189

click to vote

ASPLOS
2011
ACM

334views Programming Languages» more ASPLOS 2011»

MemScale: active low-power modes for main memory

14 years 8 months ago

Download www.cs.rutgers.edu

Main memory is responsible for a large and increasing fraction of the energy consumed by servers. Prior work has focused on exploiting DRAM low-power states to conserve energy. Ho...

Qingyuan Deng, David Meisner, Luiz E. Ramos, Thoma...

claim paper

Read More »

158

click to vote

ICCAD
2009
IEEE

179views Hardware» more ICCAD 2009»

Automatic memory partitioning and scheduling for throughput and power optimization

15 years 2 months ago

Download cadlab.cs.ucla.edu

Hardware acceleration is crucial in modern embedded system design to meet the explosive demands on performance and cost. Selected computation kernels for acceleration are usually ...

Jason Cong, Wei Jiang, Bin Liu, Yi Zou

claim paper

Read More »

148

click to vote

SAMOS
2010
Springer

165views Embedded Systems» more SAMOS 2010»

Interleaving granularity on high bandwidth memory architecture for CMPs

15 years 3 months ago

Download www.cs.huji.ac.il

—Memory bandwidth has always been a critical factor for the performance of many data intensive applications. The increasing processor performance, and the advert of single chip m...

Felipe Cabarcas, Alejandro Rico, Yoav Etsion, Alex...

claim paper

Read More »

161

click to vote

PC
2010

190views Management» more PC 2010»

High-performance cone beam reconstruction using CUDA compatible GPUs

15 years 3 months ago

Download www-hagi.ist.osaka-u.ac.jp

Compute uniﬁed device architecture (CUDA) is a software development platform that allows us to run C-like programs on the nVIDIA graphics processing unit (GPU). This paper prese...

Yusuke Okitsu, Fumihiko Ino, Kenichi Hagihara

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers