Cache Friendliness-Aware Managementof Shared Last-Level Caches for HighPerformance Multi-Core Systems
- 28 January 2013
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Vol. 63 (4), 874-887
- https://doi.org/10.1109/tc.2013.18
Abstract
To achieve high efficiency and prevent destructive interference among multiple divergent workloads, the last-level cache of Chip Multiprocessors has to be carefully managed. Previously proposed cache management schemes suffer from inefficient cache capacity utilization, by either focusing on improving the absolute number of cache misses or by allocating cache capacity without taking into consideration the applications' memory sharing characteristics. Reduction of the overall number of misses does not always correlate with higher performance as Memory-level Parallelism can hide the latency penalty of a significant number of misses in out-of-order execution. In this work we describe a quasi-partitioning scheme for last-level caches that combines the memory-level parallelism, cache friendliness and interference sensitivity of competing applications, to efficiently manage the shared cache capacity. The proposed scheme improves both system throughput and execution fairness - outperforming previous schemes that are oblivious to applications' memory behavior. Our detailed, full-system simulations showed an average improvement of 10 percent in throughput and 9 percent in fairness over the next best scheme for a four-core CMP system.Keywords
This publication has 26 references indexed in Scilit:
- Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systemsPublished by Association for Computing Machinery (ACM) ,2011
- Addressing shared resource contention in multicore processors via schedulingPublished by Association for Computing Machinery (ACM) ,2010
- A bandwidth-aware memory-subsystem resource management using non-invasive resource profilers for large CMP systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- PIPPPublished by Association for Computing Machinery (ACM) ,2009
- CacheScouts: Fine-Grain Monitoring of Shared Caches in CMP Platforms16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007), 2007
- Multifacet's general execution-driven multiprocessor simulator (GEMS) toolsetACM SIGARCH Computer Architecture News, 2005
- Inexpensive Implementations Of Set-AssociativityPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Dynamic tracking of page miss ratio curve for memory managementPublished by Association for Computing Machinery (ACM) ,2004
- A comparison of trace-sampling techniques for multi-megabyte cachesInternational Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1994
- Evaluation techniques for storage hierarchiesIBM Systems Journal, 1970