# A 32x32 pixels vision sensor for Selective Change Driven readout strategy

Pedro Zuccarello and Fernando Pardo Department of Informatics Universitat de Valencia Valencia, Spain {pedro.zuccarello, fernando.pardo}@uv.es Alejandro de la Plaza Universidad de Buenos Aires Buenos Aires, Argentina aplaza@fi.uba.ar José Antonio Boluda Department of Informatics Universitat de Valencia Valencia, Spain jose.a.boluda@uv.es

Abstract— The first VLSI vision circuit design following selective-change-driven (SCD) readout strategy is presented and simulated. Under this strategy, only pixels that exhibit large variations between the present and last transmitted illumination values are deemed to carry relevant information that is worth processing. Every pixel in a SCD VLSI implementation must be able to store and compare the present and the last transmitted illumination values. A subset of pixels exhibiting the largest variation between these two, should be read out in descending order of importance. This novel design accomplishes all these goals. Simulations on a 32x32 pixel matrix with sampling period of 500  $\mu$ s and readout interval of 10  $\mu$ s are presented. Average power consumption per pixel is in the order of 16  $\mu$ W.

#### I. INTRODUCTION

Frame-based imagers work under some very well known principles: the illumination level of the surrounding environment is sampled and transmitted at regular time intervals, even if no new information is produced. As such, a huge amount of redundant information is nevertheless processed and transmitted. Perfected over millions of years of evolution, biological vision systems work under a completely different basis. One of the key features of these systems, is that each sensor cell (pixel) reacts independently and asynchronously to illumination changes. Starting with Mahowalds's PhD work in 1992 [1], the last two decades have witnessed significant efforts towards overcoming the many limitations of framebased vision sensors and algorithms - namely by incorporating biological neuromorphic principles into VLSI hardware design. Work like [1] and [2] pioneered in this field, together with, [3], [4] and [5] show successful implementations of neuromorphic vision systems. Results in this field clearly show that bandwidth and processing power needs are substantially reduced when neuromorphic principles are applied.

Within this framework, a new readout strategy is proposed in [6]. Such a selective change-driven (SCD) system is based on the idea that a pixel showing a large change in intensity is an indicator of fast movements and object edges around it. Usually, fast movements and edges (high spatial/temporal frequencies) are the most interesting parts of a scene. In the proposed strategy, successive snapshots of the environment are taken at a fast pace, but pixel readout is performed in an entirely different manner. Pixels are now transmitted out of the chip in descending order per their illumination level difference with respect to the last transmitted value. When a pixel's address and illumination information are transmitted, the illumination value is stored in a memory within the chip so the next forthcoming value can be compared against it. Pixel readout frequency can now be adapted to bandwidth availability. In the worst case scenario, one single pixel is transmitted, this being the one with the largest illumination change in the entire scene.

While the validity of SCD readout has been extensively discussed in [6], the benefits of its application to computer vision algorithms have been clearly demonstrated by MATLAB simulations in [7]. The purpose of this paper is to advance one step further by showing the first VLSI circuit design with SPECTRE simulations towards a silicon implementation.

In the transistor level implementation, the difference between the present and the previous illumination levels is physically represented as the output current of a wide-input-range operational transconductance amplifier (OTA) [8]. According to the SCD readout strategy, a subcircuit is needed to select the maximum amongst all these currents. Such a goal can be accomplished by a winner-takes-all (WTA) circuit [9] [10] [11] [12]. If we picture a WTA circuit as a black box, we'll see that a set of currents is taken as input, and a set of voltages is produced as output. Every output voltage is associated to a single input current. Ideally, the output voltage associated to the highest input current should be well differentiated from the rest. According to the current polarities in our light sensing circuit, the winning pixel should output a voltage close to OV while the losers' output should be close to Vdd.

Given the always-limited discrimination ability of the WTA circuit, a large number of pixels increase the probability of having a certain number of cells competing with equal credentials. Therefore, multiple winners can be observed at the output nodes of the WTA amplifiers. To cope with this scenario, [13] proposes a digital logic based on the propagation of vertical and horizontal inhibition signals. Following a predefined path across the pixel matrix, inhibition signals are propagated so when the first winning output is found, the following outputs in the propagation path are forced to be close to Vdd. In favor of saving silicon area at the pixel level, only horizontal inhibition

signals are used in this particular work; but the general idea of the inhibiting mechanism remains the same.

In Section II a description of the pixel circuit is introduced. Afterwards, Section III, shows the simulation of a 32x32 pixel matrix implementation in SPECTRE. Finally, we summarize our conclusions and depict a frame of reference for future work on the subject.

# II. PIXEL CIRCUIT

## A. Light sensing and OTA subcircuits

As Fig. 1 shows, the light sensing subcircuit is based on a linear photocircuit [14]. The three main components of the photocircuit are the photodiode (whose very well known function is to transduct an incident light intensity into a photocurrent) and two capacitors:  $C_{present}$  and  $C_{previous}$ . The function of  $C_{present}$  is to store the present illumination value as a voltage once the parasitic capacitance of the photodiode is discharged by the photocurrent.  $C_{previous}$  stores the pixel's last read out value. Signal WTAfeedback is fed back from the output of the digital logic of the WTA subcircuit; in this way, only the pixel showing the largest illumination variation is read out at Vreadout node, and the voltage at its  $C_{previous}$ capacitor is updated. Transistor MPROUT and node Vreadout are common to the whole pixel matrix.

A wide-input-range OTA [8] is used to sense the voltage difference between  $C_{present}$  and  $C_{previous}$ . OTA's output current can be positive or negative depending on wehther there has been an increase or decrease in the illumination level. According to the SCD strategy we are only interested in the absolute value of this variation, therefore this current has to be fully rectified [15]. Given this, the rectified output current of the OTA subcircuit for a particular pixel is a direct measure of the difference between the last sensed and the last read out illumination values.

## B. WTA subcircuit

As explained previously, according to the SCD strategy the largest of the rectified currents should be selected. We accomplish this goal using a mixed analogue/digital WTA circuit implementation, depicted in Fig. 1.

All analogue WTA amplifiers are connected to the common node labelled WTAcommon, along with Itail current source. During normal operation, in case of a single winner cell, transistor MP4 conveys the total ITail current while remaining in the saturation region. For the defeated cells, the current throughout the MP4 transistor is negligible (<< ITail) and transistor MP1 is forced to work in the linear region with its Vds close to zero. As a result the WTAout node is close to Vdd in the loser cells and close to 0 V in the winning cells.

Transistors MP1, MP2 and MP3, and current source IPol, form a gain-boosted regulated-cascode configuration whose only objective is to increase the resolution of the analogue WTA by increasing the impedance at WTAout node. An indepth description of the analogue subcircuit is provided in [11]. The subcircuit labelled as WTA digital logic, generates a horizontal inhibition signal that follows the propagation path depicted in Fig. 2. Assuming that a logic "1" is a voltage close to Vdd and a logic "0" is close to 0 V, all InhibitForward signals in the path are sequentially asserted to "1" until the first winning cell is found. The first winning cell pulls Verdict signal to "0", and propagates an InhibitForward signal of "0". The logic gates that follow in the propagation path have inputs InhibitPrevious=0 and output Verdict set to "1". Only one single cell, the first winner found in the propagation path, has its Verdict signal in a low state.

Transistors MPinv, MNinv and MNstarved form a starved inverter that acts as a single bit analogue/digital interface presenting high impedance to the analogue output.

Care must be taken so the charge at  $C_{previous}$  capacitor is updated for the winning pixel only. Transmission gates TXG1 and TXG2, and inverters INVA and INVB, form a D latch whose objective is to feed a stable WTAfeedback signal into the photocircuit.

#### **III. SIMULATIONS AND RESULTS**

A matrix of 32 x 32 pixels is simulated with the SPECTRE circuit simulator using models provided by austriamicrosystems 0.35  $\mu$ m CMOS technology. The periods of CKphd and CKwta clock signals are set to 500  $\mu$ s and 10  $\mu$ s respectively. Fifty (50) pixels are read out per light sampling period, which, according to the SCD concept, are the most interesting ones. Two thousand (2000) light samples per second are taken. The average power consumption per pixel is in the order of 16  $\mu$ W. Vdd=3.3 V.

In previous work with SPECTRE simulations [13] we show how the implemented WTA analogue/digital subcircuit efficiently handles a multiple winner situation by selecting a single winner with a resolution in the order of 10 nA. The results provided herein are focused on the manner in which winning pixels are read out and on how the voltage on there  $C_{previous}$  capacitor is correctly updated.

The whole process is shown in Fig. 3 for pixel (29,22). This pixel was chosen for no special reason other than for being one of the multiple winners at some point in the simulation. If rows and columns are numbered in the range  $0 \dots 31$ , and the inhibiting propagation path is the one depicted in Fig. 2, then cell (29,22) inhibits cell (29,21), and cell (29,23) inhibits cell (29,22). This last pixel has its  $WTAout_{(29,22)}$  signal at a low state, which means that it belongs to the multiple winners' subset. When the simulation reaches 1.01 ms, signal InhibitPrevious(29,22) = InhibitForward(29,23) is pulled high. Since WTAout(29,22) is in a low state, Verdict(29,22) falls to low leaving pixel (29,22) as the only winner in the matrix. In the next pulse of CKwta, at 1,02 ms, WTAfeedback(29,22) is pulled low. Voltage at  $C_{present(29,22)}$  is copied to  $C_{previous(29,22)}$  and voltage at Vreadout is transmitted out of the chip. Differential voltage at  $OTA_{(29,22)}$ 's subcircuit input, now registering 0 V, produce a negligible  $Irect_{(29,22)}$  output current.



Fig. 1. Schematic of the pixel circuit.

The updated cell now loses the contest at WTA subcircuit and InhibitForward<sub>(29,22)</sub> is pulled high permiting cell (29,21), the next multiple winner in the inhibiting path, to become a single winner in the matrix.

# IV. CONCLUSION

We have described a novel circuit for vision VLSI chips, this being the first VLSI design that follows the basis of SCD strategy. Pixels that exhibit large variations between the present illumination value and the last transmitted one are the only ones considered relevant and are therefore read out in descending order of importance. Since focus is being put on those pixels that show relevant (new) information, the circuit can be considered as following the principles of neuromorphic vision system designs.



Fig. 2. Propagation path for inhibition signals.

The light sampling interval is 500  $\mu$ s. Simulations show how, every 10  $\mu$ s, the circuit pinpoints the pixel with the largest change in illumination amongst the entire pixel matrix. Hence, only the most relevant information is selected and then transmitted out of the chip for processing purposes. As a result, an equivalent velocity of 2000 frames per second is achieved, facilitating the processing of high-speed movements without the need of large bandwith or high computing power.

#### ACKNOWLEDGEMENT

This work has been supported by grant TEC2009-1298 from the Ministry of Science and Technology of Spain.

#### REFERENCES

- M. A. Mahowald, "VLSI analogs of neural visual processing: a synthesis of form and function," Ph.D. dissertation, California Institute of Technology, Pasadena, California, May 1992.
- [2] K. Boahen, "The retinomorphic approach: Pixel-parallel adaptive amplification, filtering, and quatization," *Analog Integrated Circuits and Signal Processing*, vol. 13, no. 1-2, pp. 53–68, May 1997.
- [3] P. Lichtsteiner, C. Posch, and T. Delbrück, "A 128x128 dB 15 µs latency asynchronous temporal contrast vision sensor," *IEEE Journal of Solid-State Circuits*, vol. 43, no. 2, pp. 566–576, Feb. 2008.
- [4] P.-F. Rüedi, P. Heim, F. Kaess, E. Grenet, F. Heitger, P.-Y. Burgi, S. Gyger, and P. Nussbaum, "A 128x128 pixel 120-dB dynamic-range vision-sensor chip for image contrast and orientation extraction," *IEEE Journal of Solid-State Circuits*, vol. 38, no. 12, pp. 2325–2333, Dec. 2003.
- [5] Y. M. Chi, U. Mallik, M. A. Clapp, E. Choi, G. Cauwenberghs, and R. Etienne-Cummings, "CMOS camera with in-pixel temporal change detection and ADC," *IEEE Journal of Solid-State Circuits*, vol. 42, no. 10, pp. 2187–2196, October 2007.
- [6] F. Pardo, J. A. Boluda, F. Vegara, and P. Zuccarello, "On the advantages of asynchronous pixel reading and processing for high-speed motion estimation," in *Advances in Visual Computing*, ser. Lectures Notes in Computer Science, vol. 5358/2008. Springer Berlin / Heidelberg, Dec. 2008, pp. 205–215, 4th International Symposium, ISVC 2008, Las Vegas, NV, USA, December 1-3, 2008. Proceedings, Part I.
- [7] J. A. Boluda, F. Vegara, F. Pardo, and P. Zuccarello, "Selective changedriven image processing: a speeding-up strategy," in *Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications*, ser. Lectures Notes in Computer Science, vol. 5856/2009. Springer Berlin / Heidelberg, November 2009, pp. 37–44, 14th Iberoamerican Conference on Pattern Recognition, CIARP 2009, Guadalajara, Jalisco, Mexico, November 15-18, 2009.
- [8] A. Nedungadi and T. Viswanathan, "Design of linear CMOS transconductance elements," *IEEE Transactions on Circuits and Systems*, vol. CAS-31, no. 10, pp. 891–894, October 1984.
- [9] J. Lazzaro, S. Rayckebusch, M. A. Mahowald, and C. A. Mead, Winnertake-all networks of O(n) complexity. San Mateo, CA: Morgan Kaufmann, 1989, vol. 1, pp. 703–711.

- [10] J. Starzyk and X. Fang, "CMOS current mode winner-take-all circuit with both excitatory and inhibitory feedback," *Electronics Letters*, vol. 29, no. 10, pp. 908–910, May 1993.
- [11] B. Sekerkiran and U. Cilingiroglu, "Precision improvement in currentmode winner-take-all circuits using gain-boosted regulated-cascode CMOS stages," in *The IEEE International Joint Conference on Neural Networks Proceedings*, vol. 1, May 1998, pp. 553–556.
- [12] G. Indiveri, "A current-mode hysteretic winner-take-all network, with excitatory and inhibitory coupling," *Analog Integrated Circuits and Signal Processing*, vol. 28, no. 3, pp. 279–291, 09/21 2001, 10.1023/A:1011208127849. [Online]. Available: http://dx.doi.org/10.1023/A:1011208127849
- [13] P. Zuccarello, F. Pardo, A. de la Plaza, and J. A. Boluda, "32x32 winnertake-all matrix with single winner selection," *Electronics Letters*, vol. 46, no. 5, pp. 333–335, March 2010.
- [14] A. Moini, Vision Chips. Kluwer Academic Publishers, 2000.
- [15] Z. Wang, "Novel pseudo RMS current converter for sinusoidal signals using a CMOS precision current rectifier," *IEEE Transactions on Instrumentation and Measurement*, vol. 39, no. 4, pp. 670–671, Aug. 1990.



Fig. 3. (a)InhibitForward<sub>(29,23)</sub> = InhibitPrevious<sub>(29,22)</sub>, (b) WTAout<sub>(29,22)</sub> (solid) and WTAout<sub>(29,21)</sub> (dashed), (c) Verdict<sub>(29,22)</sub> (solid) and Verdict<sub>(29,21)</sub> (dashed), (d) WTAfeedback<sub>(29,22)</sub>, (e) voltage at  $C_{previous(29,22)}$  (solid) and  $C_{present(29,22)}$  (dashed), (f) Irect<sub>(29,22)</sub>, and (g) InhibitForward<sub>(29,22)</sub>.