Axiomatic Theory of Receptive Fields

Version	Summary	Created by	Modification	Content Size	Created at	Operation
1	handwiki	Vivi Li	--	1274	2022-11-07 01:49:39

The content is sourced from: https://handwiki.org/wiki/Biology:Axiomatic_theory_of_receptive_fields

Receptive field profiles registered by cell recordings have shown that mammalian vision has developed receptive fields tuned to different sizes and orientations in the image domain as well as to different image velocities in space-time. Corresponding cell recordings in the auditory system has shown that mammals have developed receptive fields tuned to different frequencies as well as temporal transients. This article describes normative theories that have been developed to explain these properties of sensory receptive fields based on structural properties of the environment. Beyond theoretical explanation of biological phenomena, these theories can also be used for computational modelling of biological receptive fields and for building algorithms for artificial perception based on sensory data.

computational modelling receptive fields structural properties

1. Computational Theory of Visual Receptive Fields

Idealized models of visual receptive fields similar to those found in the retina, the lateral geniculate nucleus and the primary visual cortex of higher mammals can be derived in an axiomatic way from structural requirements on the first stages of visual processing that reflect symmetry properties of the surrounding world in combination with additional assumptions to ensure internally consistent image representations at multiple spatial and temporal scales.^[1]^[2] Specifically, idealized functional models for linear spatio-temporal receptive fields can be derived in a principled manner to constitute a combination of Gaussian derivatives over the spatial domain and either non-causal Gaussian derivatives or truly time-causal temporal scale-space kernels over the temporal domain: ^[1]^[2]^[3]

[math]\displaystyle{ T(x_1, x_2, t;\; s, \tau;\; v, \Sigma) = \partial_{\varphi}^{m_1} \partial_{\bot \varphi}^{m_2} \partial_{\bar t}^n \left( g(x_1 - v_1 t, x_2 - v_2 t;\; s, \Sigma) \, h(t;\; \tau) \right) }[/math]

where

[math]\displaystyle{ x = (x_1, x_2)^T }[/math] denotes the image coordinates,
[math]\displaystyle{ t }[/math] denotes time,
[math]\displaystyle{ s }[/math] denotes the spatial scale,
[math]\displaystyle{ \tau }[/math] denotes the temporal scale,
[math]\displaystyle{ v = (v_1, v_2)^T }[/math] denotes a local image velocity,
[math]\displaystyle{ \Sigma }[/math] denotes a spatial covariance matrix determining the spatial shape of an affine Gaussian kernel,
[math]\displaystyle{ m_1 }[/math] and [math]\displaystyle{ m_2 }[/math] denotes orders of spatial differentiation,
[math]\displaystyle{ n }[/math] denotes the order of temporal differentiation,
[math]\displaystyle{ \partial_{\varphi} = \cos \varphi \, \partial_{x_1} + \sin \varphi \, \partial_{x_2} }[/math] and [math]\displaystyle{ \partial_{\bot \varphi} = \sin \varphi \, \partial_{x_1} - \cos \varphi \, \partial_{x_2} }[/math] denote spatial directional derivative operators in two orthogonal directions [math]\displaystyle{ \varphi }[/math] and [math]\displaystyle{ \bot \varphi }[/math],
[math]\displaystyle{ g(x;\; s, \Sigma) = \frac{1}{2 \pi s \sqrt{\det\Sigma}} e^{-x^T \Sigma^{-1} x/2s} }[/math] is an affine Gaussian kernel with its size determined by the spatial scale parameter [math]\displaystyle{ s }[/math] and its shape by the spatial covariance matrix [math]\displaystyle{ \Sigma }[/math],
[math]\displaystyle{ g(x_1 - v_1 t, x_2 - v_2 t;\; s, \Sigma) }[/math] denotes a spatial affine Gaussian kernel that moves with image velocity [math]\displaystyle{ v = (v_1, v_2) }[/math] in space-time and
[math]\displaystyle{ h(t;\; \tau) }[/math] is a temporal smoothing kernel over time corresponding to a Gaussian kernel in the case of non-causal time or a cascade of first-order integrators or equivalently truncated exponential kernels coupled in cascade over a time-causal temporal domain.

Correspondingly, and with similar notation idealized functional models for spatial receptive fields can be expressed of the form

[math]\displaystyle{ T(x_1, x_2;\; s, \Sigma) = \partial_{\varphi}^{m_1} \partial_{\bot \varphi}^{m_2} \left( g(x_1, x_2;\; s, \Sigma) \right). }[/math]

This model specifically generalizes the receptive field model in terms of Gaussian derivatives^[4]^[5]^[6]^[7]^[8]

[math]\displaystyle{ T(x_1, x_2;\; s) = \partial_{\varphi}^{m_1} \partial_{\bot \varphi}^{m_2} \left( g(x_1, x_2;\; s) \right) }[/math]

from directional derivatives of rotationally Gaussian kernels [math]\displaystyle{ g(x_1, x_2;\; s) }[/math] to directional derivatives of affine Gaussian kernels [math]\displaystyle{ g(x_1, x_2;\; s, \Sigma) }[/math].

Idealized functional models of receptive fields of these forms have been shown to quite well reproduce the shape of spatial and spatio-temporal receptive fields measured by cell recordings of neurons in the LGN and of simple cells in the primary visual cortex (V1).^[1]^[2]^[3]^[9]^[10]

Theoretical arguments have been presented of preferring this generalized Gaussian model of receptive fields over a Gabor model of receptive fields, because of the better theoretical properties of the generalized Gaussian model under natural image transformations.^[1]^[11] Specifically, these generalized Gaussian receptive fields can be shown to enable computation of invariant visual representations under natural image transformations.^[11] By these results, the different shapes of receptive field profiles found in biological vision, which are tuned to different sizes and orientations in the image domain as well as to different image velocities in space-time, can be seen as well adapted to structure of the physical world and be explained from the requirement that the visual system should have the possibility of being invariant to the natural types of image transformations that occur in its environment.^[1]^[2]^[11]

2. Computational Theory of Auditory Receptive Fields

A computational theory for auditory receptive fields can be expressed in a structurally similar way, permitting the derivation of auditory receptive fields in two stages:^[12]^[13]

a first stage of temporal receptive fields corresponding to an idealized cochlea model modeled as a windowed Fourier transform

[math]\displaystyle{ S(t, \omega;\; \tau) = \int_{t'=-\infty}^{\infty} f(t') \, e^{-i\omega t'} \, w(t - t';\; \tau) \, dt' }[/math]

where [math]\displaystyle{ t }[/math] denotes time, [math]\displaystyle{ \omega }[/math] denotes the angular frequency, [math]\displaystyle{ \tau }[/math] denotes the temporal scale of the window function [math]\displaystyle{ w }[/math], which can be chosen as either Gabor functions in the case of non-causal time or Gammatone functions alternatively generalized Gammatone functions for a truly time-causal model in which the future cannot be accessed,

a second layer of spectra-temporal receptive fields

[math]\displaystyle{ A_{\alpha,\beta}(t, \nu;\; \Sigma) = \partial_{t}^{\alpha} \partial_{\nu}^{\beta} \left( g(\nu - v t;\; s) \, T(t;\; \tau) \right) }[/math]

applied to the magnitude of the logarithmically transformed spectrogram

[math]\displaystyle{ S_{dB} = 20 \log_{10} \left( \frac{|S|}{S_0} \right) }[/math]

where

[math]\displaystyle{ \nu }[/math] denotes the logarithmic frequency,
[math]\displaystyle{ \Sigma }[/math] is a spectro-temporal covariance matrix determining the shape of the second-layer receptive field over the spectro-temporal domain,
[math]\displaystyle{ \alpha }[/math] is the order of temporal differentiation,
[math]\displaystyle{ \beta }[/math] is the order of logspectral differentiation,
the smoothing over the logspectral domain is modeled as a Gaussian function [math]\displaystyle{ g(\nu - v t;\; s) }[/math] extended with glissando adaptation with
a glissando parameter [math]\displaystyle{ v }[/math] to account for frequency variations over time

and with the temporal smoothing kernels [math]\displaystyle{ T(t;\; \tau) }[/math] chosen as either Gaussian kernels over time in the case of non-causal time or first-order integrators (truncated exponential kernels) coupled in cascade in the case of truly time-causal operations.

The shapes of the receptive field functions in these models can be determined by necessity from structural properties of the environment combined with requirements about the internal structure of the auditory system to enable theoretically well-founded processing of sound signals at different temporal and log-spectral scales. Specifically, the resulting spectro-temporal fields in this model obey invariance or covariance properties over natural sound transformations including: (i) temporal shifts, (ii) variations in sound pressure, (iii) the distance between the sound source and the observer, (iv) a shift in the frequencies of auditory stimuli and (v) glissando transformations.^[12]^[13]

Idealized receptive fields of this form can be shown to well model the qualitative shape of spectro-temporal receptive fields as measured by cell recordings in the inferior colliculus (ICC) as well as the linear component of some receptive fields measured in the primary auditory cortex.^[12]^[13]

References

T. Lindeberg (2013) "A computational theory of visual receptive fields", Biological Cybernetics, 107(6): 589-635. https://dx.doi.org/10.1007/s00422-013-0569-z
T. Lindeberg (2016) "Time-causal and time-recursive spatio-temporal receptive fields", Journal of Mathematical Imaging and Vision 55(1): 50-88. http://www.csc.kth.se/~tony/abstracts/Lin16-JMIV.html
T. Lindeberg (2011) "Generalized Gaussian scale-space axiomatics comprising linear scale-space, affine scale-space and spatio-temporal scale-space", Journal of Mathematical Imaging and Vision, 40(1): 36-81. http://www.csc.kth.se/~tony/abstracts/Lin10-GenGaussScSp.html
J. J. Koenderink and A. J. van Doorn (1987) "Representation of local geometry in the visual system", Biological Cybernetics 55:367–375.
R. A. Young (1987) "The Gaussian derivative model for spatial vision: I. Retinal mechanisms", Spatial Vision 2(4): 273-293.
J. J. Koenderink and A. J. van Doorn (1992) "Generic neighbourhood operators", IEEE Transactions on Pattern Analysis and Machine Intelligence, 14: 597-605.
T. Lindeberg (1993) Scale-Space Theory in Computer Vision, Springer, 1993, ISBN:0-7923-9418-6. http://www.csc.kth.se/~tony/book.html
T. Lindeberg (1994). "Scale-space theory: A basic tool for analysing structures at different scales". Journal of Applied Statistics 21 (2): pp. 224–270. doi:10.1080/757582976. http://www.csc.kth.se/~tony/abstracts/Lin94-SI-abstract.html.
G. C. DeAngelis, I. Ohzawa and R. D. Freeman (1995) "Receptive field dynamics in the central visual pathways". Trends Neurosci. 18(10), 451–457.
G. C. DeAngelis and A. Anzai (2004) "A modern view of the classical receptive field: linear and non-linear spatio-temporal processing by V1 neurons. In: Chalupa, L.M., Werner, J.S. (eds.) The Visual Neurosciences, vol. 1, pp. 704–719. MIT Press, Cambridge.
T. Lindeberg (2013) "Invariance of visual operations at the level of receptive fields", PLOS ONE 8(7): e66990, pages 1-33. https://dx.doi.org/10.1371/journal.pone.0066990
T. Lindeberg and A. Friberg (2015) "Idealized computational models of auditory receptive fields", PLOS ONE, 10(3): e0119032, pages 1-58. https://dx.doi.org/10.1371/journal.pone.0119032
T. Lindeberg and A. Friberg (2015) "Scale-space theory for auditory signals", Proc. SSVM 2015: Scale-Space and Variational Methods in Computer Vision, Springer LNCS 9087: 3-15. http://www.csc.kth.se/~tony/abstracts/LinFri15-SSVM.html

©Text is available under the terms and conditions of the Creative Commons-Attribution ShareAlike (CC BY-SA) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.

Upload a video for this entry

Information

Subjects: Others

Contributor MDPI registered users' name will be linked to their SciProfiles pages. To register with us, please refer to https://encyclopedia.pub/register :

HandWiki

View Times: 804

Entry Collection: HandWiki

Update Date: 07 Nov 2022

Table of Contents

Video Upload Options

1. Computational Theory of Visual Receptive Fields

2. Computational Theory of Auditory Receptive Fields

References