PhonItalia: a phonological lexicon for Italian

Behav Res Methods. 2014 Sep;46(3):872-86. doi: 10.3758/s13428-013-0400-8.

Abstract

In this article, we present the first open-access lexical database that provides phonological representations for 120,000 Italian word forms. Each of these also includes syllable boundaries and stress markings and a comprehensive range of lexical statistics. Using data derived from this lexicon, we have also generated a set of derived databases and provided estimates of positional frequency use for Italian phonemes, syllables, syllable onsets and codas, and character and phoneme bigrams. These databases are freely available from phonitalia.org. This article describes the methods, content, and summarizing statistics for these databases. In a first application of this database, we also demonstrate how the distribution of phonological substitution errors made by Italian aphasic patients is related to phoneme frequency.

MeSH terms

  • Aphasia / physiopathology*
  • Databases, Factual
  • Humans
  • Internet
  • Italy
  • Language*
  • Linear Models
  • Phonation
  • Phonetics
  • Psycholinguistics / methods*
  • Reproducibility of Results