The gradient-based cache partitioning algorithm

Abstract

This paper addresses the problem of partitioning a cache between multiple concurrent threads and in the presence of hardware prefetching. Cache replacement designed to preserve temporal locality (e.g., LRU) will allocate cache resources proportional to the miss-rate of each competing thread irrespective of whether the cache space will be utilized [Qureshi and Patt 2006]. This is clearly suboptimal as applications vary dramatically in their use of recently accessed data. We address this problem by partitioning a shared cache such that a global goodness metric is optimized. This paper introduces the Gradient-based Cache Partitioning Algorithm (GPA), whose variants optimize either hitrate, total instructions per cycle (IPC) or a weighted IPC metric designed to enforce Quality of Service (QoS) [Iyer 2004]. In the context of QoS, GPA enables us to obtain the maximum throughput of low-priority threads, while ensuring high performance on high-priority threads. The GPA mechanism is robust, low-cost, integrates easily with existing cache designs and improves the throughput of an in-order 8-core system sharing an 8MB L3 cache by ∼14%.

Keywords

This publication has 17 references indexed in Scilit:

QoS policies and architecture for cache/memory in CMP platforms
Published by Association for Computing Machinery (ACM) ,2007
Virtual private caches
Published by Association for Computing Machinery (ACM) ,2007
From chaos to QoS
ACM SIGARCH Computer Architecture News, 2007
Architectural support for operating system-driven CMP cache management
Published by Association for Computing Machinery (ACM) ,2006
A Case for MLP-Aware Cache Replacement
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
An analytical model for cache replacement policy performance
Published by Association for Computing Machinery (ACM) ,2006
A Cache-Partitioning Aware Replacement Policy for Chip Multiprocessors
Lecture Notes in Computer Science, 2006
Performance evaluation of cache replacement policies for the SPEC CPU2000 benchmark suite
Published by Association for Computing Machinery (ACM) ,2004
Dynamic Partitioning of Shared Cache Memory
The Journal of Supercomputing, 2004
Optimal partitioning of cache memory
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1992

Cited by 11 articles