A tale of two lexica: Investigating computational pressures on word representation with neural networks

Enes Avcu; Michael Hwang; Kevin Scott Brown; David W Gow

doi:10.3389/frai.2023.1062230

A tale of two lexica: Investigating computational pressures on word representation with neural networks

Front Artif Intell. 2023 Mar 27:6:1062230. doi: 10.3389/frai.2023.1062230. eCollection 2023.

Authors

Enes Avcu¹, Michael Hwang², Kevin Scott Brown³, David W Gow^{1

4

5

6}

Affiliations

¹ Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States.
² Harvard College, Boston, MA, United States.
³ Department of Pharmaceutical Sciences and School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, OR, United States.
⁴ Athinoula A. Martinos Center for Biomedical Imaging Massachusetts General Hospital, Charlestown, MA, United States.
⁵ Department of Psychology, Salem State University, Salem, MA, United States.
⁶ Harvard-MIT Division of Health Sciences and Technology, Boston, MA, United States.

Abstract

Introduction: The notion of a single localized store of word representations has become increasingly less plausible as evidence has accumulated for the widely distributed neural representation of wordform grounded in motor, perceptual, and conceptual processes. Here, we attempt to combine machine learning methods and neurobiological frameworks to propose a computational model of brain systems potentially responsible for wordform representation. We tested the hypothesis that the functional specialization of word representation in the brain is driven partly by computational optimization. This hypothesis directly addresses the unique problem of mapping sound and articulation vs. mapping sound and meaning.

Results: We found that artificial neural networks trained on the mapping between sound and articulation performed poorly in recognizing the mapping between sound and meaning and vice versa. Moreover, a network trained on both tasks simultaneously could not discover the features required for efficient mapping between sound and higher-level cognitive states compared to the other two models. Furthermore, these networks developed internal representations reflecting specialized task-optimized functions without explicit training.

Discussion: Together, these findings demonstrate that different task-directed representations lead to more focused responses and better performance of a machine or algorithm and, hypothetically, the brain. Thus, we imply that the functional specialization of word representation mirrors a computational optimization strategy given the nature of the tasks that the human brain faces.

Keywords: deep learning; dorsal and ventral streams; functional segregation; mental lexicon; neural networks; word representation.

Grants and funding

R01 DC015455/DC/NIDCD NIH HHS/United States