Dissociating language and thought in large language models

Kyle Mahowald; Anna A Ivanova; Idan A Blank; Nancy Kanwisher; Joshua B Tenenbaum; Evelina Fedorenko

doi:10.1016/j.tics.2024.01.011

Dissociating language and thought in large language models

Trends Cogn Sci. 2024 Mar 19:S1364-6613(24)00027-5. doi: 10.1016/j.tics.2024.01.011. Online ahead of print.

Authors

Kyle Mahowald¹, Anna A Ivanova², Idan A Blank³, Nancy Kanwisher⁴, Joshua B Tenenbaum⁵, Evelina Fedorenko⁶

Affiliations

¹ The University of Texas at Austin, Austin, TX, USA. Electronic address: mahowald@utexas.edu.
² Georgia Institute of Technology, Atlanta, GA, USA. Electronic address: a.ivanova@gatech.edu.
³ University of California, Los Angeles, CA, USA. Electronic address: iblank@psych.ucla.edu.
⁴ Massachusetts Institute of Technology, Cambridge, MA, USA. Electronic address: ngk@mit.edu.
⁵ Massachusetts Institute of Technology, Cambridge, MA, USA. Electronic address: jbt@mit.edu.
⁶ Massachusetts Institute of Technology, Cambridge, MA, USA. Electronic address: evelina9@mit.edu.

PMID: 38508911
DOI: 10.1016/j.tics.2024.01.011

Abstract

Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their performance on functional competence tasks remains spotty and often requires specialized fine-tuning and/or coupling with external modules. We posit that models that use language in human-like ways would need to master both of these competence types, which, in turn, could require the emergence of separate mechanisms specialized for formal versus functional linguistic competence.

Keywords: cognitive neuroscience; computational modeling; language and thought; large language models; linguistic competence.

Publication types

Review