AC-DIMM
- 23 June 2013
- journal article
- research article
- Published by Association for Computing Machinery (ACM) in ACM SIGARCH Computer Architecture News
- Vol. 41 (3), 189-200
- https://doi.org/10.1145/2508148.2485939
Abstract
With technology scaling, on-chip power dissipation and off-chip memory bandwidth have become significant performance bottlenecks in virtually all computer systems, from mobile devices to supercomputers. An effective way of improving performance in the face of bandwidth and power limitations is to rely on associative memory systems. Recent work on a PCM-based, associative TCAM accelerator shows that associative search capability can reduce both off-chip bandwidth demand and overall system energy. Unfortunately, previously proposed resistive TCAM accelerators have limited flexibility: only a restricted (albeit important) class of applications can benefit from a TCAM accelerator, and the implementation is confined to resistive memory technologies with a high dynamic range ( R High /R Low ), such as PCM. This work proposes AC-DIMM, a flexible, high-performance associative compute engine built on a DDR3-compatible memory module. AC-DIMM addresses the limited flexibility of previous resistive TCAM accelerators by combining two powerful capabilities---associative search and processing in memory. Generality is improved by augmenting a TCAM system with a set of integrated, user programmable microcontrollers that operate directly on search results, and by architecting the system such that key-value pairs can be co-located in the same TCAM row. A new, bit-serial TCAM array is proposed, which enables the system to be implemented using STT-MRAM. AC-DIMM achieves a 4.2X speedup and a 6.5X energy reduction over a conventional RAM-based system on a set of 13 evaluated applications.Keywords
Funding Information
- International Business Machines Corporation
- Division of Computing and Communication Foundations (CCF-1054179)
- New York State Office of Science and Technology
- Qualcomm
- Cisco Systems
- Samsung
This publication has 29 references indexed in Scilit:
- A resistive TCAM accelerator for data-intensive computingPublished by Association for Computing Machinery (ACM) ,2011
- Standby-Power-Free Compact Ternary Content-Addressable Memory Cell Chip Using Magnetic Tunnel Junction DevicesApplied Physics Express, 2009
- Algorithms for advanced packet classification with ternary CAMsPublished by Association for Computing Machinery (ACM) ,2005
- Giant tunnelling magnetoresistance at room temperature with MgO (100) tunnel barriersNature Materials, 2004
- A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data localityPublished by Association for Computing Machinery (ACM) ,2000
- SPEC CPU2000: measuring CPU performance in the New MillenniumComputer, 2000
- Memory access schedulingPublished by Association for Computing Machinery (ACM) ,2000
- On using the CAM concept for parametric curve extractionIEEE Transactions on Image Processing, 2000
- Processing in memory: the Terasys massively parallel PIM arrayComputer, 1995
- A content-addressable memory architecture for image coding using vector quantizationIEEE Transactions on Signal Processing, 1991