Repository logo
 

Search Results

Now showing 1 - 8 of 8
  • An integrated framework for combining gist vision with object segregation categorisation and recognition
    Publication . Rodrigues, J. M. F.; Almeida, D.; Martins, Jaime; Lam, Roberto
    There are roughly two processing systems: (1) very fast gist vision of entire scenes, completely bottom-up and data driven, and (2) Focus-of-Attention (FoA) with sequential screening of specific image regions and objects. The latter system has to be sequential because unnormalised input objects must be matched against normalised templates of canonical object views stored in memory, which involves dynamic routing of features in the visual pathways.
  • Cortical multiscale line-edge disparity model
    Publication . Rodrigues, J. M. F.; Martins, Jaime; Lam, Roberto; du Buf, J. M. H.
    Most biological approaches to disparity extraction rely on the disparity energy model (DEM). In this paper we present an alternative approach which can complement the DEM model. This approach is based on the multiscale coding of lines and edges, because surface structures are composed of lines and edges and contours of objects often cause edges against their background. We show that the line/edge approach can be used to create a 3D wireframe representation of a scene and the objects therein. It can also significantly improve the accuracy of the DEM model, such that our biological models can compete with some state-of-the-art algorithms from computer vision.
  • A disparity energy model improved by line, edge and keypoint correspondences
    Publication . Martins, J. C.; Farrajota, Miguel; Lam, Roberto; Rodrigues, J. M. F.; Terzic, Kasim; du Buf, J. M. H.
    Disparity energy models (DEMs) estimate local depth information on the basis ofVl complex cells. Our recent DEM (Martins et al, 2011 ISSPlT261-266) employs a population code. Once the population's cells have been trained with randorn-dot stereograms, it is applied at all retinotopic positions in the visual field. Despite producing good results in textured regions, the model needs to be made more precise, especially at depth transitions.
  • Building the what and where systems: multi-scale lines, edges and keypoints
    Publication . Rodrigues, J. M. F.; Almeida, D.; Nunes, S.; Lam, Roberto; du Buf, J. M. H.
    Computer vision for realtime applications requires tremendous computational power because all images must be processed from the first to the last pixel. Ac tive vision by probing specific objects on the basis of already acquired context may lead to a significant reduction of processing. This idea is based on a few concepts from our visual cortex (Rensink, Visual Cogn. 7, 17-42, 2000): (1) our physical surround can be seen as memory, i.e. there is no need to construct detailed and complete maps, (2) the bandwidth of the what and where systems is limited, i.e. only one object can be probed at any time, and (3) bottom-up, low-level feature extraction is complemented by top-down hypothesis testing, i.e. there is a rapid convergence of activities in dendritic/axonal connections.
  • Artistic rendering of the visual cortex
    Publication . Lam, Roberto; Rodrigues, J. M. F.; du Buf, J. M. H.
    In this paper we explain the processing in the first layers of the visual cortex by simple, complex and endstopped cells, plus grouping cells for line, edge, keypoint and saliency detection. Three visualisations are presented: (a) an integrated scheme that shows activities of simple, complex and end-stopped cells, (b) artistic combinations of selected activity maps that give an impression of global image structure and/or local detail, and (c) NPR on the basis of a 2D brightness model. The cortical image representations offer many possibilities for non-photorealistic rendering.
  • Cortical 3D face and object recognition using 2D projections
    Publication . Rodrigues, J. M. F.; Lam, Roberto; du Buf, J. M. H.
    Empirical studies concerning face recognition suggest that faces may be stored in memory by a few canonical representations. In cortical area V1 exist double-opponent colour blobs, also simple, complex and end-stopped cells which provide input for a multiscale line/edge representation, keypoints for dynamic feature routine, and saliency maps for Focus-of-Attention.
  • Cortical 3D Face Recognition Framework
    Publication . Rodrigues, J. M. F.; Lam, Roberto; du Buf, J. M. H.
    Empirical studies concerning face recognition suggest that faces may be stored in memory by a few canonical representations. In cortical area V1 exist double-opponent colour blobs, also simple, complex and end-stopped cells which provide input for a multiscale line/edge representation, keypoints for dynamic routing and saliency maps for Focus-of-Attention. All these combined allow us to segregate faces. Events of different facial views are stored in memory and combined in order to identify the view and recognise the face including facial expression. In this paper we show that with five 2D views and their cortical representations it is possible to determine the left-right and frontal-lateral-profile views and to achieve view-invariant recognition of 3D faces.
  • Looking through the eyes of the painter: from visual perception to non-photorealistic rendering
    Publication . Lam, Roberto; Rodrigues, J. M. F.; du Buf, J. M. H.
    In this paper we present a brief overview of the processing in the primary visual cortex, the multi-scale line/edge and keypoint representations, and a model of brightness perception. This model, which is being extended from 1D to 2D, is based on a symbolic line and edge interpretation: lines are represented by scaled Gaussians and edges by scaled, Gaussian-windowed error functions. We show that this model, in combination with standard techniques from graphics, provides a very fertile basis for non-photorealistic image rendering.