Detecting structured repetition in child-surrounding speech: Evidence from maximally diverse languages

Cognition. 2022 Apr:221:104986. doi: 10.1016/j.cognition.2021.104986. Epub 2021 Dec 23.

Abstract

Caretakers tend to repeat themselves when speaking to children, either to clarify their message or to redirect wandering attention. This repetition also appears to support language learning. For example, words that are heard more frequently tend to be produced earlier by young children. However, pure repetition only goes so far; some variation between utterances is necessary to support acquisition of a fully productive grammar. When individual words or morphemes are repeated, but embedded in different lexical and syntactic contexts, the child has more information about how these forms may be used and combined. Corpus analysis has shown that these partial repetitions frequently occur in clusters, which have been coined variation sets. More recent research has introduced algorithms that can extract these variation sets automatically from corpora with the goal of measuring their relative prevalence across ages and languages. Longitudinal analyses have revealed that rates of variation sets tend to decrease as children get older. We extend this research in several ways. First, we consider a maximally diverse sample of languages, both genealogically and geographically, to test the generalizability of developmental trends. Second, we compare multiple levels of repetition, both words and morphemes, to account for typological differences in how information is encoded. Third, we consider several additional measures of development to account for deficiencies in age as a measure of linguistic aptitude. Fourth, we examine whether the levels of repetition found in child-surrounding speech is greater or less than what would have been expected by chance. This analysis produced a new measure, redundancy, which captures how repetitive speech is on average given how repeititive it could have been. Fifth, we compare rates of repetition in child-surrounding and adult-directed speech to test whether variation sets are especially prevalent in child-surrounding speech. We find that (1) some languages show increases in repetition over development, (2) true estimates of variation sets are generally lower than or equal to random baselines, (3) these patterns are largely convergent across developmental indices, and (4) adult-directed speech is reliably less redundant, though in some cases more repetitive, than child-surrounding speech. These results are discussed with respect to features of the corpora, typological properties of the languages, and differential rates of change in repetition and redundancy over children's development.

Keywords: Child-directed speech; Cross-linguistic language acquisition; Input patterns; Variation sets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Child, Preschool
  • Humans
  • Language Development
  • Language*
  • Linguistics
  • Speech*