Spatial Adaptation Models

Next: Practical issues in designing Up: Adaptation Previous: Light Adaptive Architectures

Spatial Adaptation Models

Adaptation the the light level can be achieved by using spatial information and removing the redundant parts of the information. In its simplest and most intuitive form the global spatial average can be used as the common signal among all detectors. Removing the average can be performed by using subtraction or division. The dynamic range will still be limited to a global value. Removing local average instead of the global average can increase the dynamic range by several orders of magnitude. The reason being that each region in the image only adapts to its own average. There has been several models for spatial adaptation which we will review in this section. One may find similarities between the response of the systems based on these models and those from biological retinas. Some of the salient features of the spatial response of biological retina are(See for example [Nabet and Pinter 91]):

Dynamic range reduction: The output signals of the retina have a much smaller dynamic range than its inputs. Therefore, further processing stages do not need to cope with large dynamic range.
Edge enhancement: Edges are very important for every image processing task. By enhancing the edges further processing stages will receive more reliable inputs.
Intensity dependent spatial response: At low light levels, a biological retina acts as a smoothing filter, and at high intensities it performs as a spatial band-pass filter. Also as the light level decreases the span of the spatial processing widens.

These properties have been depicted in Figure 7.42. Now let us review the models.

Figure 7.42: Properties of the retina. a) the dynamic range reduction and edge enhancement features. The numbers are just as indicators of the signal levels. b) Intensity dependent response.

Subtraction from local average
In this method the local average is subtracted from the signal in each cell. Sometimes, two local averages with different spatial distribution are subtracted from each other. This method has two disadvantages. Firstly, the signal will be centered around ``zero'', and the signal variation will depend on the local average. For example, if the average current is about 1nA the signal variation will be around this value. Secondly, this method cannot reproduce the intensity dependent response. Due to its intuitive nature this method has been used in many VLSI implementations of the retina [Mead and Mahowald 88, Bair and Koch 91a, Wu and Chiu 95].
Division by local average
In this method the signal from one cell is divided by the local average. One main advantage of this method over subtraction is that the output is now centered around ``one''. Therefore, the output can now be normalized to a desired value. In subtraction method, if an offset (for example 100nA) is added to move the center from ``zero'', small values may get lost. Another advantage of this method is its multiplicative noise cancellation (MNC) feature. By dividing the signal to the local average, AC noise from artificial light source, which reflect from the surface of objects (and hence have a multiplicative nature), can be reduced to a small fraction. In fact this method was first introduced for this purpose [Moini et al. 95a, Moini et al. 97b]. It has been used as a pre-processor in a motion detection chip to reduce the effect of AC light sources [Moini et al. 97b, Moini et al. 95b].
This method also cannot reproduce the intensity dependent behavior.
Surface reconstruction from noisy data
This model has originally been developed as a way for reducing the noise and recovering the signal from noisy data. Using the regularization theory, one can find the solution to this problem using the biharmonic equation (See section 2.6). The VLSI implementation of this model can produce a normalized output, and demonstrates the dynamic range improvement and edge enhancement features. Although an intensity dependence response has been reported using the implemented circuits, the change of the response is opposite the change in biological retina, i.e. at higher light levels the spatial span of the impulse response becomes wider [Boahen and Andreou 92, Andreou and Boahen 94b].
The theoretical model also does not originally account for intensity dependent characteristics, but it can be modified to include this feature too.
Linear lateral inhibition
Linear lateral inhibition is a simple form of lateral inhibition, where the signal in one cell is subtracted from fractions of the neighboring cells. This model can demonstrate the edge enhancement and dynamic range improvement features. However, it still cannot reproduce the intensity dependent behavior. This method has been used in the implementation of a few shunting inhibition vision chips [Wolpert and Micheli-Tzanakou 93].
Multiplicative lateral inhibition (Shunting inhibition)
In shunting inhibition (SI) a proportion of the output signals of each cell and its neighbors are subtracted from the signal in each channel (See sections 2.19 and 3.27). This model has in fact been developed to model the behavior of the biological retina. It has demonstrated all properties of the biological retina. Several vision chips have been designed based on shunting inhibition [Darling and Dietze 93, Moini et al. 97a].

It is now clear that many VLSI implementations of the retina have deficiencies, in the context of replicating the function of the retina; and only shunting inhibition can be regarded as the closest model for replicating the functionality of the retina.

However, in the context of providing a vision chip which can improve the dynamic range, enhance the edges, and yet be VLSI friendly the ``division by average'' and the model based on biharmonic equation can also be considered as viable options.

Next: Practical issues in designing Up: Adaptation Previous: Light Adaptive Architectures

Alireza Moini,
Centre for High Performance Integrated Technologies and Systems (CHIPTEC),
Adelaide, SA 5005,
March 1997