Training neural networks to encode symbols enables combinatorial generalization

Philos Trans R Soc Lond B Biol Sci. 2020 Feb 3;375(1791):20190309. doi: 10.1098/rstb.2019.0309. Epub 2019 Dec 16.

Abstract

Combinatorial generalization-the ability to understand and produce novel combinations of already familiar elements-is considered to be a core capacity of the human mind and a major challenge to neural network models. A significant body of research suggests that conventional neural networks cannot solve this problem unless they are endowed with mechanisms specifically engineered for the purpose of representing symbols. In this paper, we introduce a novel way of representing symbolic structures in connectionist terms-the vectors approach to representing symbols (VARS), which allows training standard neural architectures to encode symbolic knowledge explicitly at their output layers. In two simulations, we show that neural networks not only can learn to produce VARS representations, but in doing so they achieve combinatorial generalization in their symbolic and non-symbolic output. This adds to other recent work that has shown improved combinatorial generalization under some training conditions, and raises the question of whether specific mechanisms or training routines are needed to support symbolic processing. This article is part of the theme issue 'Towards mechanistic models of meaning composition'.

Keywords: combinatorial generalization; neural networks; symbols.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Humans
  • Learning
  • Neural Networks, Computer*
  • Symbolism*

Associated data

  • figshare/10.6084/m9.figshare.c.4723922