Cache Friendliness-Aware Managementof Shared Last-Level Caches for HighPerformance Multi-Core Systems

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Vol. 63 (4), 874-887
https://doi.org/10.1109/tc.2013.18

Abstract

To achieve high efficiency and prevent destructive interference among multiple divergent workloads, the last-level cache of Chip Multiprocessors has to be carefully managed. Previously proposed cache management schemes suffer from inefficient cache capacity utilization, by either focusing on improving the absolute number of cache misses or by allocating cache capacity without taking into consideration the applications' memory sharing characteristics. Reduction of the overall number of misses does not always correlate with higher performance as Memory-level Parallelism can hide the latency penalty of a significant number of misses in out-of-order execution. In this work we describe a quasi-partitioning scheme for last-level caches that combines the memory-level parallelism, cache friendliness and interference sensitivity of competing applications, to efficiently manage the shared cache capacity. The proposed scheme improves both system throughput and execution fairness - outperforming previous schemes that are oblivious to applications' memory behavior. Our detailed, full-system simulations showed an average improvement of 10 percent in throughput and 9 percent in fairness over the next best scheme for a four-core CMP system.

Keywords

This publication has 26 references indexed in Scilit:

Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems
Published by Association for Computing Machinery (ACM) ,2011
Addressing shared resource contention in multicore processors via scheduling
Published by Association for Computing Machinery (ACM) ,2010
A bandwidth-aware memory-subsystem resource management using non-invasive resource profilers for large CMP systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
PIPP
Published by Association for Computing Machinery (ACM) ,2009
CacheScouts: Fine-Grain Monitoring of Shared Caches in CMP Platforms
16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007), 2007
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
ACM SIGARCH Computer Architecture News, 2005
Inexpensive Implementations Of Set-Associativity
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Dynamic tracking of page miss ratio curve for memory management
Published by Association for Computing Machinery (ACM) ,2004
A comparison of trace-sampling techniques for multi-megabyte caches
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1994
Evaluation techniques for storage hierarchies
IBM Systems Journal, 1970

Cited by 19 articles