Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

50

TC
1998

favoriteEmaildiscussreport

139views Information Technology» more TC 1998»

Performance Evaluation and Cost Analysis of Cache Protocol Extensions for Shared-Memory Multiprocessors

14 years 4 months ago

Performance Evaluation and Cost Analysis of Cache Protocol Extensions for Shared-Memory Multiprocessors

Download www.cs.odu.edu

—We evaluate three extensions to directory-based cache coherence protocols in shared-memory multiprocessors. These extensions are aimed at reducing the penalties associated with memory accesses and include a hardware prefetching scheme, a migratory sharing optimization, and a competitive-update mechanism. Since each extension targets distinct components of the read and write penalties, they can be combined effectively. This paper identifies the combinations yielding the best performance gains and cost trade-offs in the context of a class of cache-coherent NUMA (Non-Uniform Memory Access) architectures. Detailed architectural simulations of a multiprocessor with single-issue, statically scheduled CPUs, using five benchmarks, show that the protocol extensions often provide additive gains when they are properly combined. For example, the combination of prefetching with the competitive-update mechanism speeds up the execution by nearly a factor of two under release consistency. The same ...

Fredrik Dahlgren, Michel Dubois, Per Stenströ

Real-time Traffic

Competitive-update Mechanism | Directory-based Cache Coherence | Information Technology | Migratory Sharing Optimization | TC 1998 |

claim paper

Related Content

» A performance evaluation of cache injection in busbased shared memory multiprocessors

» A Novel Lightweight Directory Architecture for Scalable SharedMemory Multiprocessors

» The Performance of CacheCoherent Ringbased Multiprocessors

» SoftwareExtended Coherent Shared Memory Performance and Cost

» Willow A Scalable Shared Memory Multiprocessor

» A Performance Evaluation of the Convex SPP1000 Scalable Shared Memory Parallel Computer

» Simplifying Active Memory Clusters by Leveraging Directory Protocol Threads

» A PowerAware PredictionBased Cache Coherence Protocol for Chip Multiprocessors

» Towards optimizing energy costs of algorithms for shared memory architectures

Post Info
More Details (n/a)

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	1998
Where	TC
Authors	Fredrik Dahlgren, Michel Dubois, Per Stenström

Comments (0)