YodaNN: An Ultra-Low Power Convolutional Neural Network Accelerator Based on Binary Weights
- 1 July 2016
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 236-241
- https://doi.org/10.1109/isvlsi.2016.111
Abstract
Convolutional Neural Networks (CNNs) have revolutionized the world of image classification over the last few years, pushing the computer vision close beyond human accuracy. The required computational effort of CNNs today requires power-hungry parallel processors and GP-GPUs. Recent efforts in designing CNN Application-Specific Integrated Circuits (ASICs) and accelerators for System-On-Chip (SoC) integration have achieved very promising results. Unfortunately, even these highly optimized engines are still above the power envelope imposed by mobile and deeply embedded applications and face hard limitations caused by CNN weight I/O and storage. On the algorithmic side, highly competitive classification accuracy canbe achieved by properly training CNNs with binary weights. This novel algorithm approach brings major optimization opportunities in the arithmetic core by removing the need for the expensive multiplications as well as in the weight storage and I/O costs. In this work, we present a HW accelerator optimized for BinaryConnect CNNs that achieves 1510 GOp/s on a corearea of only 1.33 MGE and with a power dissipation of 153 mW in UMC 65 nm technology at 1.2 V. Our accelerator outperforms state-of-the-art performance in terms of ASIC energy efficiency as well as area efficiency with 61.2 TOp/s/W and 1135 GOp/s/MGE, respectively.Keywords
Other Versions
This publication has 16 references indexed in Scilit:
- 14.5 Eyeriss: An energy-efficient reconfigurable accelerator for deep convolutional neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- 14.1 A 126.1mW real-time natural UI/UX processor with embedded deep-learning core for low-power smart glassesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- ShiDianNaoACM SIGARCH Computer Architecture News, 2015
- OrigamiPublished by Association for Computing Machinery (ACM) ,2015
- A Ultra-Low-Energy Convolution Engine for Fast Brain-Inspired Vision in Multicore ClustersPublished by EDAA ,2015
- A 240 G-ops/s Mobile Coprocessor for Deep Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- DianNaoACM SIGPLAN Notices, 2014
- NeuFlow: Dataflow vision processing system-on-a-chipPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- NeuFlow: A runtime reconfigurable dataflow processor for visionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Decomposing a scene into geometric and semantically consistent regionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009