Sciweavers

MICRO
1997
IEEE
90views Hardware» more  MICRO 1997»
14 years 4 months ago
ProfileMe: Hardware Support for Instruction-Level Profiling on Out-of-Order Processors
Profile data is valuable for identifying performance bottlenecks and guiding optimizations. Periodic sampling of a processor's performance monitoring hardware is an effective...
Jeffrey Dean, James E. Hicks, Carl A. Waldspurger,...
MICRO
1997
IEEE
108views Hardware» more  MICRO 1997»
14 years 4 months ago
Improving the Accuracy and Performance of Memory Communication Through Renaming
As processors continue to exploit more instruction level parallelism, a greater demand is placed on reducing the e ects of memory access latency. In this paper, we introduce a nov...
Gary S. Tyson, Todd M. Austin
MICRO
1997
IEEE
110views Hardware» more  MICRO 1997»
14 years 4 months ago
The Design and Performance of a Conflict-Avoiding Cache
High performance architectures depend heavily on efficient multi-level memory hierarchies to minimize the cost of accessing data. This dependence will increase with the expected i...
Nigel P. Topham, Antonio González, Jos&eacu...
MICRO
1997
IEEE
92views Hardware» more  MICRO 1997»
14 years 4 months ago
The Predictability of Data Values
Yiannakis Sazeides, James E. Smith
MICRO
1997
IEEE
93views Hardware» more  MICRO 1997»
14 years 4 months ago
Trace Processors
Eric Rotenberg, Quinn Jacobson, Yiannakis Sazeides...
MICRO
1997
IEEE
79views Hardware» more  MICRO 1997»
14 years 4 months ago
On High-Bandwidth Data Cache Design for Multi-Issue Processors
Highly aggressive multi-issue processor designs of the past few years and projections for the next decade require that we redesign the operation of the cache memory system. The nu...
Jude A. Rivers, Gary S. Tyson, Edward S. Davidson,...
MICRO
1997
IEEE
127views Hardware» more  MICRO 1997»
14 years 4 months ago
Exploiting Dead Value Information
We describe Dead Value Information (DVI) and introduce three new optimizations which exploit it. DVI provides assertions that certain register values are dead, meaning they will n...
Milo M. K. Martin, Amir Roth, Charles N. Fischer
MICRO
1997
IEEE
116views Hardware» more  MICRO 1997»
14 years 4 months ago
Tuning Compiler Optimizations for Simultaneous Multithreading
Compiler optimizations are often driven by specific assumptions about the underlying architecture and implementation of the target machine. For example, when targeting shared-mem...
Jack L. Lo, Susan J. Eggers, Henry M. Levy, Sujay ...
MICRO
1997
IEEE
89views Hardware» more  MICRO 1997»
14 years 4 months ago
Available Parallelism in Video Applications
Heng Liao, Andrew Wolfe
MICRO
1997
IEEE
87views Hardware» more  MICRO 1997»
14 years 4 months ago
Improving Code Density Using Compression Techniques
We propose a method for compressing programs in embedded processors where instruction memory size dominates cost. A post-compilation analyzer examines a program and replaces commo...
Charles Lefurgy, Peter L. Bird, I-Cheng K. Chen, T...