The Puzzle of Evaluating Moral Cognition in Artificial Agents

Madeline G Reinecke; Yiran Mao; Markus Kunesch; Edgar A Duéñez-Guzmán; Julia Haas; Joel Z Leibo

doi:10.1111/cogs.13315

The Puzzle of Evaluating Moral Cognition in Artificial Agents

Cogn Sci. 2023 Aug;47(8):e13315. doi: 10.1111/cogs.13315.

Authors

Madeline G Reinecke^{1

2}, Yiran Mao¹, Markus Kunesch¹, Edgar A Duéñez-Guzmán¹, Julia Haas¹, Joel Z Leibo¹

Affiliations

¹ Google DeepMind.
² Department of Psychology, Yale University.

PMID: 37555649
DOI: 10.1111/cogs.13315

Abstract

In developing artificial intelligence (AI), researchers often benchmark against human performance as a measure of progress. Is this kind of comparison possible for moral cognition? Given that human moral judgment often hinges on intangible properties like "intention" which may have no natural analog in artificial agents, it may prove difficult to design a "like-for-like" comparison between the moral behavior of artificial and human agents. What would a measure of moral behavior for both humans and AI look like? We unravel the complexity of this question by discussing examples within reinforcement learning and generative AI, and we examine how the puzzle of evaluating artificial agents' moral cognition remains open for further investigation within cognitive science.

Keywords: Artificial intelligence; Moral cognition; Multi-agent reinforcement learning.

Publication types

Letter

MeSH terms

Artificial Intelligence*
Cognition*
Humans
Judgment
Learning
Morals