Loading...
Research Project
FACE AND OBJECT RECOGNITION BY 3D CORTICAL REPRESENTATIONS
Funder
Authors
Publications
Face and object recognition by 3D cortical representations
Publication . Martins, Jaime Afonso do Nascimento Carvalho; du Buf, J.M.H.; Rodrigues, J.M.F.
This thesis presents a novel integrated cortical architecture with significant
emphasis on low-level attentional mechanisms—based on retinal nonstandard
cells and pathways—that can group non-attentional, bottom-up
features present in V1/V2 into “proto-object” shapes. These shapes are extracted
at first using combinations of specific cell types for detecting corners,
bars/edges and curves which work extremely well for geometrically
shaped objects. Later, in the parietal pathway (probably in LIP), arbitrary
shapes can be extracted from population codes of V2 (or even dorsal V3)
oriented cells that encode the outlines of objects as “proto-objects”. Object
shapes obtained at both cortical levels play an important role in bottom-up
local object gist vision, which tries to understand scene context in less than
70 ms and is thought to use both global and local scene features.
Edge conspicuity maps are able to detect borders/edges of objects and
attribute them a weight based on their perceptual salience, using readily
available retinal ganglion cell colour-opponency coding. Conspicuity maps
are fundamental in building posterior saliency maps—important for both
bottom-up attention schemes and also for Focus-of-Attention mechanisms
that control eye gaze and object recognition.
Disparity maps are also a main focus of this thesis. They are built upon
binocular simple and complex cells in quadrature, using a Disparity-Enery
Model. These maps are fundamental for perception of distance within a
scene and close/far object relationships in doing foreground to background
segregation.
The role of cortical disparity in 3D facial recognition was also explored
when processing faces with very different facial expressions (even extreme
ones), yielding state-of-the-art results when compared to other, non-biological,
computer vision algorithms.
Cortical multiscale line-edge disparity model
Publication . Rodrigues, J. M. F.; Martins, Jaime; Lam, Roberto; du Buf, J. M. H.
Most biological approaches to disparity extraction rely on
the disparity energy model (DEM). In this paper we present an alternative
approach which can complement the DEM model. This approach
is based on the multiscale coding of lines and edges, because surface
structures are composed of lines and edges and contours of objects often
cause edges against their background. We show that the line/edge approach
can be used to create a 3D wireframe representation of a scene
and the objects therein. It can also significantly improve the accuracy of
the DEM model, such that our biological models can compete with some
state-of-the-art algorithms from computer vision.
Luminance, colour, viewpoint and border enhanced disparity energy model
Publication . Martins, Jaime; Rodrigues, Joao; du Buf, J. M. H.
The visual cortex is able to extract disparity information through the use of binocular cells. This process is reflected by the Disparity Energy Model, which describes the role and functioning of simple and complex binocular neuron populations, and how they are able to extract disparity. This model uses explicit cell parameters to mathematically determine preferred cell disparities, like spatial frequencies, orientations, binocular phases and receptive field positions. However, the brain cannot access such explicit cell parameters; it must rely on cell responses. In this article, we implemented a trained binocular neuronal population, which encodes disparity information implicitly. This allows the population to learn how to decode disparities, in a similar way to how our visual system could have developed this ability during evolution. At the same time, responses of monocular simple and complex cells can also encode line and edge information, which is useful for refining disparities at object borders. The brain should then be able, starting from a low-level disparity draft, to integrate all information, including colour and viewpoint perspective, in order to propagate better estimates to higher cortical areas.
Organizational Units
Description
Keywords
Contributors
Funders
Funding agency
Fundação para a Ciência e a Tecnologia
Funding programme
Funding Award Number
SFRH/BD/44941/2008