There have been several recent efforts to improve the performance of fences. The most aggressive designs allow postfence accesses to retire and complete before the fence completes...
Systems from smartphones to supercomputers are increasingly heterogeneous, being composed of both CPUs and GPUs. To maximize cost and energy efficiency, these systems will increa...
Neha Agarwal, David W. Nellans, Mark Stephenson, M...
Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current specifications of languages and hardware are inconclusive; thus programmers oft...
Jade Alglave, Mark Batty, Alastair F. Donaldson, G...
The past decade has witnessed the increasing demands on data-driven business intelligence that led to the proliferation of data-intensive applications. A managed object-oriented p...
Khanh Nguyen, Kai Wang, Yingyi Bu, Lu Fang, Jianfe...
Quantum computing (QC) offers huge promise to accelerate a range of computationally intensive benchmarks. Quantum computing is limited, however, by the challenges of decoherence: ...
Jeff Heckey, Shruti Patil, Ali JavadiAbhari, Adam ...
This paper presents PARD, a programmable architecture for resourcing-on-demand that provides a new programming interface to convey an application’s high-level information like q...
This paper presents the design and implementation of PolyMage, a domain-specific language and compiler for image processing pipelines. An image processing pipeline can be viewed ...
Ravi Teja Mullapudi, Vinay Vasista, Uday Bondhugul...
This paper describes a new bottom-up, subset-based, and context-sensitive pointer analysis for Java. The main novelty of our technique is the constraint-based handling of virtual m...
Abstract. We study the efficient implementation of call-by-value using the structure of interactive computation models. This structure has been useful in applications to resource-...