The Multilingual Picture Database

Sci Data. 2022 Jul 21;9(1):431. doi: 10.1038/s41597-022-01552-7.

Abstract

The growing interdisciplinary research field of psycholinguistics is in constant need of new and up-to-date tools which will allow researchers to answer complex questions, but also expand on languages other than English, which dominates the field. One type of such tools are picture datasets which provide naming norms for everyday objects. However, existing databases tend to be small in terms of the number of items they include, and have also been normed in a limited number of languages, despite the recent boom in multilingualism research. In this paper we present the Multilingual Picture (Multipic) database, containing naming norms and familiarity scores for 500 coloured pictures, in thirty-two languages or language varieties from around the world. The data was validated with standard methods that have been used for existing picture datasets. This is the first dataset to provide naming norms, and translation equivalents, for such a variety of languages; as such, it will be of particular value to psycholinguists and other interested researchers. The dataset has been made freely available.

Publication types

  • Dataset

MeSH terms

  • Databases, Factual
  • Humans
  • Language
  • Multilingualism*
  • Psycholinguistics*
  • Recognition, Psychology