Two Computational Approaches to Visual Analogy: Task-Specific Models Versus Domain-General Mapping

Nicholas Ichien; Qing Liu; Shuhao Fu; Keith J Holyoak; Alan L Yuille; Hongjing Lu

doi:10.1111/cogs.13347

Two Computational Approaches to Visual Analogy: Task-Specific Models Versus Domain-General Mapping

Cogn Sci. 2023 Sep;47(9):e13347. doi: 10.1111/cogs.13347.

Authors

Nicholas Ichien¹, Qing Liu², Shuhao Fu³, Keith J Holyoak^{3

4}, Alan L Yuille^{5

6}, Hongjing Lu^{3

7}

Affiliations

¹ Department of Psychology, University of Pennsylvania.
² Adobe Research.
³ Department of Psychology, University of California.
⁴ Brain Research Institute, University of California.
⁵ Department of Computer Science, Johns Hopkins University.
⁶ Department of Cognitive Science, Johns Hopkins University.
⁷ Department of Statistics, University of California.

PMID: 37718474
DOI: 10.1111/cogs.13347

Abstract

Advances in artificial intelligence have raised a basic question about human intelligence: Is human reasoning best emulated by applying task-specific knowledge acquired from a wealth of prior experience, or is it based on the domain-general manipulation and comparison of mental representations? We address this question for the case of visual analogical reasoning. Using realistic images of familiar three-dimensional objects (cars and their parts), we systematically manipulated viewpoints, part relations, and entity properties in visual analogy problems. We compared human performance to that of two recent deep learning models (Siamese Network and Relation Network) that were directly trained to solve these problems and to apply their task-specific knowledge to analogical reasoning. We also developed a new model using part-based comparison (PCM) by applying a domain-general mapping procedure to learned representations of cars and their component parts. Across four-term analogies (Experiment 1) and open-ended analogies (Experiment 2), the domain-general PCM model, but not the task-specific deep learning models, generated performance similar in key aspects to that of human reasoners. These findings provide evidence that human-like analogical reasoning is unlikely to be achieved by applying deep learning with big data to a specific type of analogy problem. Rather, humans do (and machines might) achieve analogical reasoning by learning representations that encode structural information useful for multiple tasks, coupled with efficient computation of relational similarity.

Keywords: Analogy; Computational modeling; Deep learning; Relations; Visual reasoning.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Artificial Intelligence*
Humans
Intelligence*
Knowledge
Problem Solving