Image descriptions in early and mid-level vision: what kind of model is this and what kind of models do we really need?

M J Dry; N Kogo; T Putzeys; J Wagemans

doi:10.1348/000712609X458053

Image descriptions in early and mid-level vision: what kind of model is this and what kind of models do we really need?

Br J Psychol. 2010 Feb;101(Pt 1):27-32; author reply 41-6. doi: 10.1348/000712609X458053. Epub 2009 Jul 3.

Authors

M J Dry¹, N Kogo, T Putzeys, J Wagemans

Affiliation

¹ Laboratory of Experimental Psychology, University of Leuven, Leuven, Belgium.

PMID: 19580700
DOI: 10.1348/000712609X458053

Abstract

In The utility of image descriptions in the initial stages of vision: A case study of printed text, Watt and Dakin (2010) describe a model that integrates mechanisms at both early and middle stages of visual processing, and provide a demonstration of the application of the model to the relational organization of printed text. In the following, we discuss a number of the merits of this approach, but argue that it is (at this stage) highly difficult to assess the utility of this model as a plausible description of human visual processing. First, we indicate that the authors' description of the model is underspecified. Second, we question the generalizability of the model. Third, we argue that the model needs to be directly compared to quantitative empirical data. Fourth, we argue that the model needs to be directly compared to alternative models.

Publication types

Comment
Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Humans
Image Processing, Computer-Assisted*
Models, Biological*
Photic Stimulation
Vision, Ocular / physiology*
Visual Perception