[Neural Networks 45 \(2013\) 4–26](http://dx.doi.org/10.1016/j.neunet.2013.05.011)

Contents lists available at [SciVerse ScienceDirect](http://www.elsevier.com/locate/neunet)

Neural Networks

journal homepage: [www.elsevier.com/locate/neunet](http://www.elsevier.com/locate/neunet)



Andrew S. C[a](#page-0-0)ssidy <sup>a, 1</sup>, Julius Georgiou <sup>[b](#page-0-2)</sup>, Andreas G. Andreou <sup>a[,b,](#page-0-2)</sup>\*

<span id="page-0-2"></span><span id="page-0-0"></span><sup>a</sup> *Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA* <sup>b</sup> *Department of Electrical and Computer Engineering, University of Cyprus, Nicosia 1678, Cyprus*

## a r t i c l e i n f o

*Keywords:* Silicon brains Neuromorphic engineering Silicon neurons Learning in silicon FPGA neural arrays

### a b s t r a c t

We present a design framework for neuromorphic architectures in the nano-CMOS era. Our approach to the design of spiking neurons and STDP learning circuits relies on parallel computational structures where neurons are abstracted as digital arithmetic logic units and communication processors. Using this approach, we have developed arrays of silicon neurons that scale to millions of neurons in a single stateof-the-art Field Programmable Gate Array (FPGA). We demonstrate the validity of the design methodology through the implementation of cortical development in a circuit of spiking neurons, STDP synapses, and neural architecture optimization.

© 2013 Elsevier Ltd. All rights reserved.

# **1. The computer and the brain**

The brain is a massively parallel and efficient information processing system, with a radically different computational architecture from present day computers. Characteristics of neural computation include event based processing, fine-grained parallel computational units, robustness and redundancy, as well as adaptation and learning, all done under severe constraints of size, weight, and energy resources. This computational architecture excels at lower-level sensory information processing such as vision, and sensor–motor integration as well as cognitive tasks such as speech and language understanding.

Over the last half century computer scientists, architects and engineers have envisioned building computers that match the parallel processing capabilities of biological brains. Fifty years ago, the fathers of computer science Alan Turing [\(Turing,](#page--1-0) [1952\)](#page--1-0) and John von-Neumann [\(Neumann,](#page--1-1) [1958\)](#page--1-1) looked to the brain for inspiration in order to advance the science of computing.

Twenty-five years ago, the connectionist movement emerged as an alternative approach to artificial intelligence for solving the hard problems in perception and cognition. The central doctrine in the connectionist movement is that the cognitive abilities of the brain are a result of a highly interconnected network of simple processing units. These simple non-linear computational

<span id="page-0-1"></span>1 Now with IBM Research, Almaden, USA.

units abstract the function of neurons while synapses abstract the connections between neurons. The strength of the synaptic connections in networks of such units is determined through a learning algorithm. A two volume edited book-set by the ''Parallel Distributed Research Group'' [\(McClelland,](#page--1-2) [Rumelhardt,](#page--1-2) [&](#page--1-2) [Group,](#page--1-2) [1987;](#page--1-2) [Rumelhart,](#page--1-3) [McClelland,](#page--1-3) [&](#page--1-3) [Group,](#page--1-3) [1987\)](#page--1-3) defined the research agenda in the field of connectionist architectures and neural networks in the decades that followed. At about the same time, Carver Mead's book ''Analog VLSI and Neural Systems'' [\(Mead,](#page--1-4) [1989\)](#page--1-4) inspired a new generation of scientists and engineers to explore hardware implementation of neural models in state-ofthe-art silicon integrated circuit technology. The book had a dual objective: (i) to create a new design discipline for collective computational systems using analog VLSI subthreshold CMOS integrated circuit technology and (ii) to promote a synthetic approach in the understanding of biology and the human brain. This was the birth of neuromorphic design as an engineering discipline.

## *1.1. Neuromorphic engineering: the formative years*

''Neuromorphic'' electronic systems, a term coined by Carver Mead in the late 1980s, describes systems that perform artificial computation based on the principles of neurobiological circuits. In the following two decades, inspired by Mead's pioneering work [\(Mead,](#page--1-5) [1990\)](#page--1-5) and colleagues at Caltech, a large number of CMOS neuromorphic chip designs have been reported in the literature.

These spanned a wide range of designs from analog VLSI models of neurons [\(Arthur](#page--1-6) [&](#page--1-6) [Boahen,](#page--1-6) [2010;](#page--1-6) [Hsin,](#page--1-7) [Saighi,](#page--1-7) [Buhry,](#page--1-7) [&](#page--1-7) [Renaud,](#page--1-7) [2010;](#page--1-7) [Saighi,](#page--1-8) [Bornat,](#page--1-8) [Tomas,](#page--1-8) [Le](#page--1-8) [Masson,](#page--1-8) [&](#page--1-8) [Renaud,](#page--1-8) [2010;](#page--1-8) [Yu,](#page--1-9) [Sejnowski,](#page--1-9) [&](#page--1-9) [Cauwenberghs,](#page--1-9) [2011\)](#page--1-9) to silicon retina architectures





<span id="page-0-3"></span><sup>∗</sup> Corresponding author at: Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.

*E-mail addresses:* [andrewca@us.ibm.com](mailto:andrewca@us.ibm.com) (A.S. Cassidy), [julio@ucy.ac.cy](mailto:julio@ucy.ac.cy) (J. Georgiou), [andreou@jhu.edu](mailto:andreou@jhu.edu) (A.G. Andreou).

<sup>0893-6080/\$ –</sup> see front matter © 2013 Elsevier Ltd. All rights reserved. <http://dx.doi.org/10.1016/j.neunet.2013.05.011>

[\(Boahen](#page--1-10) [&](#page--1-10) [Andreou,](#page--1-10) [1992;](#page--1-10) [Mahowald,](#page--1-11) [1992\)](#page--1-11), and retinomorphic vision systems [\(Boahen,](#page--1-12) [1996\)](#page--1-12), to attention circuits [\(Horiuchi](#page--1-13) [&](#page--1-13) [Koch,](#page--1-13) [1999\)](#page--1-13), and biomorphic imagers [\(Culurciello,](#page--1-14) [Etienne-](#page--1-14)[Cummings,](#page--1-14) [&](#page--1-14) [Boahen,](#page--1-14) [2003\)](#page--1-14) that abstract biology at a lower level. Other mixed-mode designs [\(Andreou,](#page--1-15) [Meitzler,](#page--1-15) [Strohben,](#page--1-15) [&](#page--1-15) [Boahen,](#page--1-15) [1995;](#page--1-15) [Pardo,](#page--1-16) [Dierickx,](#page--1-16) [&](#page--1-16) [Scheffer,](#page--1-16) [1998\)](#page--1-16) and [\(Etienne-](#page--1-17)[Cummings,](#page--1-17) [Kalayjian,](#page--1-17) [&](#page--1-17) [Donghui,](#page--1-17) [2001\)](#page--1-17) have also implemented silicon retinas and focal plane processing architectures that include processing beyond gain control and spatio-temporal filtering, including polarization sensing [\(Andreou](#page--1-18) [&](#page--1-18) [Kalayjian,](#page--1-18) [2002;](#page--1-18) [Wolff](#page--1-19) [&](#page--1-19) [Andreou,](#page--1-19) [1995\)](#page--1-19). Most of the above bio-inspired sensors have limited programmability as they employ analog computational circuits at the focal plane.

The shortcomings of non-programmable analog architectures motivated the exploration of analog vision chip architectures with programmable functionality [\(Serrano-Gotarredona,](#page--1-20) [Andreou,](#page--1-20) [&](#page--1-20) [Linares-Barranco,](#page--1-20) [1999;](#page--1-20) [Serrano-Gotarredona](#page--1-21) [et al.,](#page--1-21) [2009\)](#page--1-21). Programmable architectures for associative memory [\(Boahen,](#page--1-22) [Pouliquen,](#page--1-22) [Andreou,](#page--1-22) [&](#page--1-22) [Jenkins,](#page--1-22) [1989;](#page--1-22) [Pouliquen,](#page--1-23) [Andreou,](#page--1-23) [&](#page--1-23) [Strohben,](#page--1-23) [1997\)](#page--1-23), pattern classification [\(Genov](#page--1-24) [&](#page--1-24) [Cauwenberghs,](#page--1-24) [2001;](#page--1-24) [Karakiewicz,](#page--1-25) [Genov,](#page--1-25) [&](#page--1-25) [Cauwenberghs,](#page--1-25) [2007\)](#page--1-25) and audition [\(Kumar,](#page--1-26) [Himmelbauer,](#page--1-26) [Cauwenberghs,](#page--1-26) [&](#page--1-26) [Andreou,](#page--1-26) [1998;](#page--1-26) [Stanace](#page--1-27)[vic](#page--1-27) [&](#page--1-27) [Cauwenberghs,](#page--1-27) [2005\)](#page--1-27) have also been reported in the literature.

Programmable architectures have also been advanced by the adoption of a standard interface between chips known as Address Event Representation or (AER) in short. The time-multiplexed AER bus [\(Boahen,](#page--1-28) [2000;](#page--1-28) [Lin](#page--1-29) [&](#page--1-29) [Boahen,](#page--1-29) [2009;](#page--1-29) [Mahowald,](#page--1-11) [1992;](#page--1-11) [Sivilotti,](#page--1-30) [1991\)](#page--1-30) is a popular interconnect method for neuromorphic systems. Spike events from multiple channels are time-multiplexed onto a digital AER bus, transmitted, and decoded at the destination onto individual channels. Throughout this proposal, we use the terms spikes, events, and spike events interchangeably. AER has been used by many analog and digital spiking neural arrays, as well as to communicate events from off-chip neuromorphic sensors and even in 3D CMOS technology [\(Harrison,](#page--1-31) [Özgün,](#page--1-31) [Lin,](#page--1-31) [Andreou,](#page--1-31) [&](#page--1-31) [Etienne-Cummings,](#page--1-31) [2010\)](#page--1-31). The European Union project CAVIAR [\(http://www2.imse-cnm.csic.es/caviar/\)](http://www2.imse-cnm.csic.es/caviar/) demonstrated a board-level vision system architecture communicating using the AER protocol [\(Serrano-Gotarredona](#page--1-21) [et al.,](#page--1-21) [2009\)](#page--1-21). Variants of AER to improve the efficiency of the protocol have also been proposed [\(Georgiou](#page--1-32) [&](#page--1-32) [Andreou,](#page--1-32) [2006,](#page--1-32) [2007\)](#page--1-33). A probabilistic approach to AER has been exploited to perform computations in the address domain [\(Goldberg,](#page--1-34) [Cauwenberghs,](#page--1-34) [&](#page--1-34) [Andreou,](#page--1-34) [2001b\)](#page--1-34).

Learning in silicon has also been pursued intensively in the analog VLSI neuromorphic community. The early work by Diorio and colleagues [\(Diorio,](#page--1-35) [Hasler,](#page--1-35) [Minch,](#page--1-35) [&](#page--1-35) [Mead,](#page--1-35) [1996,](#page--1-35) [1997\)](#page--1-36), the Field Programmable Analog Arrays [\(Sivilotti,](#page--1-30) [1991\)](#page--1-30) and the [r](#page--1-37)esearch program of Hasler [\(Hall,](#page--1-37) [Twigg,](#page--1-37) [Gray,](#page--1-37) [Hasler,](#page--1-37) [&](#page--1-37) [Ander](#page--1-37)[son,](#page--1-37) [2005\)](#page--1-37) paved the way to floating gate MOS transistors in configurable learning chips. Other designs employ dynamic circuits for implementing learning in analog VLSI with excellent results on small systems [\(Bartolozzi](#page--1-38) [&](#page--1-38) [Indiveri,](#page--1-38) [2007;](#page--1-38) [Indiveri,](#page--1-39) [Chicca,](#page--1-39) [&](#page--1-39) [Douglas,](#page--1-39) [2004;](#page--1-39) [Mahowald,](#page--1-11) [1992\)](#page--1-11). This work has continued with encouraging results for hardware models that abstract higher-level functions such as stimulus specific adaptation [\(Mill,](#page--1-40) [Sheik,](#page--1-40) [Indiveri,](#page--1-40) [&](#page--1-40) [Denham,](#page--1-40) [2011\)](#page--1-40) and working memory using attractor dynamics [\(Giulioni](#page--1-41) [et al.,](#page--1-41) [2011\)](#page--1-41).

Abstracting biology at a higher level, the Cellular Non-linear/ Neural Networks (CNN) approach [\(Chua](#page--1-42) [&](#page--1-42) [Yang,](#page--1-42) [1988\)](#page--1-42) offered another paradigm for an analog visual processor with programming capabilities. In CNN architectures, information processing is implemented through the evolution of a continuous-time non-linear dynamical network with nearest neighborhood connectivity. The CNN–UM (Universal Machine) is one of the earliest systems [\(Roska](#page--1-43) [&](#page--1-43) [Chua,](#page--1-43) [1993\)](#page--1-43) that implemented CNN programmable functionality on a chip. Another example of CNN hardware implementation merges a CNN–UM type processor and an imager [\(Carmona](#page--1-44) [et al.,](#page--1-44) [1998;](#page--1-44) [Dominguez-Castro](#page--1-45) [et al.,](#page--1-45) [1997\)](#page--1-45). This system, while analog internally, has a digital interface with on-chip 7-bit A/D and D/A converters, improving the programmability and simplifying the interface to digital computers [\(Cembrano](#page--1-46) [et al.,](#page--1-46) [2004\)](#page--1-46).

Programmable analog VLSI circuits and systems aimed at largescale model simulation have also been under development in the last decade. The Neurogrid architecture in Kwabena's group [\(Arthur](#page--1-6) [&](#page--1-6) [Boahen,](#page--1-6) [2010;](#page--1-6) [Choudhary](#page--1-47) [et al.,](#page--1-47) [2012;](#page--1-47) [Silver,](#page--1-48) [Boahen,](#page--1-48) [Grillner,](#page--1-48) [Kopell,](#page--1-48) [&](#page--1-48) [Olsen,](#page--1-48) [2007\)](#page--1-48), the IFAT architecture [\(Goldberg,](#page--1-49) [Cauwenberghs,](#page--1-49) [&](#page--1-49) [Andreou,](#page--1-49) [2001a;](#page--1-49) [Vogelstein,](#page--1-50) [Mallik,](#page--1-50) [Culurciello,](#page--1-50) [Cauwenberghs,](#page--1-50) [&](#page--1-50) [Etienne-Cummings,](#page--1-50) [2007\)](#page--1-50), the PAX platform [\(Renaud](#page--1-51) [et al.,](#page--1-51) [2010\)](#page--1-51) and the FACETS wafer-scale computational infrastructure [\(Bruederle](#page--1-52) [et al.,](#page--1-52) [2011\)](#page--1-52) are notable projects in this direction.

#### *1.2. Neuromorphic engineering: the nano-CMOS Era*

In 1986, Mead's group at Caltech was employing bulk CMOS technology with  $\lambda$  between 2.5 micron and 0.7 micron (p. 59 of [Mead,](#page--1-4) [1989\)](#page--1-4). A quick review of our own publications and laboratory notebooks from that period, reveals that we were fabricating chips in 4 micron Silicon On Sapphire (SOS)–CMOS technology and in 3 micron *p*-well bulk CMOS. Alas! Twenty five years later, with foundry CMOS technologies at the 45 nm and 22 nm nodes, the neuromorphic engineering community at large has not been able to capitalize on the benefits of the  $(x10000)$  improvements in digital MOS transistor area density to engineer brain like structures and cognitive machines that match the effectiveness and energetic efficiency of the human brain. With the exception of the event-based, asynchronous vision sensors [\(Lichtsteiner,](#page--1-53) [Posch,](#page--1-53) [&](#page--1-53) [Delbruck,](#page--1-53) [2008\)](#page--1-53) and subsequent design [\(Posch,](#page--1-54) [Matolin,](#page--1-54) [&](#page--1-54) [Wohlgenannt,](#page--1-54) [2011\)](#page--1-54), the goals of endowing modern computer systems with industrial-strength robust bioinspired sensoria or tackling the challenge of silicon cognition have been unrealized. And even though our lack of knowledge about the inner workings of brain function and behavior has contributed to this chasm and is limiting us today, matching the information processing capabilities of biological neural structures in state-ofthe-art silicon technology has remained an elusive dream despite the stunning advances in microelectronics.

Even more elusive has been our quest to understand how to achieve the energy efficiency seen in biological brains. One would have thought that the research activities in the last two decades would have brought us closer to both a deeper understanding of brain function as well as to commercially-viable brain-inspired information technology at the scale. However, this is not the case. Many of the analog VLSI neuromorphic systems rely on analog devices and as such, scaling the density of these components (mostly MOS transistors and capacitors) did not follow Moore's law. Furthermore, the majority of neuromorphic hardware was based on traditional ''analog'' circuit models of neurons and synapses, a technology that does not offer flexibility in component models, nor in their level of description; an aspect which impedes rapid advances.

Mead advocated using analog transistor physics to perform neural computation, directly mimicking the currents in neuron ion channels [\(Mead,](#page--1-5) [1990\)](#page--1-5), and speculated that an energy savings of approximately  $10<sup>4</sup>$  could be gained over comparable traditional digital approaches. However the power dissipation of neuromorphic systems did not benefit from technology scaling either and our best circuits today hover between 10 and 100 nW per computational cell. Each cell has typically one or two single pole circuits with two or three current branches biased in the nanoampere current level. Even though one could argue the power dissipation is manageable locally, the energy cost to send the Download English Version:

<https://daneshyari.com/en/article/406490>

Download Persian Version:

<https://daneshyari.com/article/406490>

[Daneshyari.com](https://daneshyari.com/)