AI Psychometrics: Assessing the Psychological Profiles of Large Language Models Through Psychometric Inventories

Max Pellert; Clemens M Lechner; Claudia Wagner; Beatrice Rammstedt; Markus Strohmaier

doi:10.1177/17456916231214460

AI Psychometrics: Assessing the Psychological Profiles of Large Language Models Through Psychometric Inventories

Perspect Psychol Sci. 2024 Jan 2:17456916231214460. doi: 10.1177/17456916231214460. Online ahead of print.

Authors

Max Pellert¹, Clemens M Lechner², Claudia Wagner^{2

3

4}, Beatrice Rammstedt², Markus Strohmaier^{1

2

4}

Affiliations

¹ Business School, University of Mannheim.
² GESIS-Leibniz Institute for the Social Sciences.
³ Department of Society, Technology and Human Factors, RWTH Aachen University.
⁴ Complexity Science Hub Vienna, Vienna, Austria.

PMID: 38165766
DOI: 10.1177/17456916231214460

Abstract

We illustrate how standard psychometric inventories originally designed for assessing noncognitive human traits can be repurposed as diagnostic tools to evaluate analogous traits in large language models (LLMs). We start from the assumption that LLMs, inadvertently yet inevitably, acquire psychological traits (metaphorically speaking) from the vast text corpora on which they are trained. Such corpora contain sediments of the personalities, values, beliefs, and biases of the countless human authors of these texts, which LLMs learn through a complex training process. The traits that LLMs acquire in such a way can potentially influence their behavior, that is, their outputs in downstream tasks and applications in which they are employed, which in turn may have real-world consequences for individuals and social groups. By eliciting LLMs' responses to language-based psychometric inventories, we can bring their traits to light. Psychometric profiling enables researchers to study and compare LLMs in terms of noncognitive characteristics, thereby providing a window into the personalities, values, beliefs, and biases these models exhibit (or mimic). We discuss the history of similar ideas and outline possible psychometric approaches for LLMs. We demonstrate one promising approach, zero-shot classification, for several LLMs and psychometric inventories. We conclude by highlighting open challenges and future avenues of research for AI Psychometrics.

Keywords: artificial intelligence; gender/sex diversity beliefs; large language model; moral foundations; natural language inference; natural language processing; personality; psychometrics; values.