Brain-inspired model for early vocal learning and correspondence matching using free-energy optimization

Alexandre Pitti; Mathias Quoy; Sofiane Boucenna; Catherine Lavandier

doi:10.1371/journal.pcbi.1008566

Brain-inspired model for early vocal learning and correspondence matching using free-energy optimization

PLoS Comput Biol. 2021 Feb 18;17(2):e1008566. doi: 10.1371/journal.pcbi.1008566. eCollection 2021 Feb.

Authors

Alexandre Pitti¹, Mathias Quoy¹, Sofiane Boucenna¹, Catherine Lavandier¹

Affiliation

¹ Laboratoire ETIS, CY Cergy Paris University, ENSEA, CNRS, UMR8051, Cergy, France.

Abstract

We propose a developmental model inspired by the cortico-basal system (CX-BG) for vocal learning in babies and for solving the correspondence mismatch problem they face when they hear unfamiliar voices, with different tones and pitches. This model is based on the neural architecture INFERNO standing for Iterative Free-Energy Optimization of Recurrent Neural Networks. Free-energy minimization is used for rapidly exploring, selecting and learning the optimal choices of actions to perform (eg sound production) in order to reproduce and control as accurately as possible the spike trains representing desired perceptions (eg sound categories). We detail in this paper the CX-BG system responsible for linking causally the sound and motor primitives at the order of a few milliseconds. Two experiments performed with a small and a large audio database show the capabilities of exploration, generalization and robustness to noise of our neural architecture in retrieving audio primitives during vocal learning and during acoustic matching with unheared voices (different genders and tones).

MeSH terms

Algorithms
Auditory Cortex / physiology
Auditory Perception / physiology
Basal Ganglia / physiology
Brain / physiology*
Child Development / physiology
Computational Biology
Female
Humans
Infant
Language Development
Learning / physiology*
Male
Models, Neurological*
Models, Psychological
Nerve Net / physiology
Neural Networks, Computer
Unsupervised Machine Learning
Verbal Behavior / physiology*

Grants and funding

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.