YodaNN: An Ultra-Low Power Convolutional Neural Network Accelerator Based on Binary Weights

1 July 2016

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 236-241
https://doi.org/10.1109/isvlsi.2016.111

Abstract

Convolutional Neural Networks (CNNs) have revolutionized the world of image classification over the last few years, pushing the computer vision close beyond human accuracy. The required computational effort of CNNs today requires power-hungry parallel processors and GP-GPUs. Recent efforts in designing CNN Application-Specific Integrated Circuits (ASICs) and accelerators for System-On-Chip (SoC) integration have achieved very promising results. Unfortunately, even these highly optimized engines are still above the power envelope imposed by mobile and deeply embedded applications and face hard limitations caused by CNN weight I/O and storage. On the algorithmic side, highly competitive classification accuracy canbe achieved by properly training CNNs with binary weights. This novel algorithm approach brings major optimization opportunities in the arithmetic core by removing the need for the expensive multiplications as well as in the weight storage and I/O costs. In this work, we present a HW accelerator optimized for BinaryConnect CNNs that achieves 1510 GOp/s on a corearea of only 1.33 MGE and with a power dissipation of 153 mW in UMC 65 nm technology at 1.2 V. Our accelerator outperforms state-of-the-art performance in terms of ASIC energy efficiency as well as area efficiency with 61.2 TOp/s/W and 1135 GOp/s/MGE, respectively.

Keywords

Other Versions

This publication has 16 references indexed in Scilit:

14.5 Eyeriss: An energy-efficient reconfigurable accelerator for deep convolutional neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
14.1 A 126.1mW real-time natural UI/UX processor with embedded deep-learning core for low-power smart glasses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
ShiDianNao
ACM SIGARCH Computer Architecture News, 2015
Origami
Published by Association for Computing Machinery (ACM) ,2015
A Ultra-Low-Energy Convolution Engine for Fast Brain-Inspired Vision in Multicore Clusters
Published by EDAA ,2015
A 240 G-ops/s Mobile Coprocessor for Deep Neural Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
DianNao
ACM SIGPLAN Notices, 2014
NeuFlow: Dataflow vision processing system-on-a-chip
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
NeuFlow: A runtime reconfigurable dataflow processor for vision
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Decomposing a scene into geometric and semantically consistent regions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009

Cited by 136 articles