The Puzzle of Evaluating Moral Cognition in Artificial Agents

Cogn Sci. 2023 Aug;47(8):e13315. doi: 10.1111/cogs.13315.

Abstract

In developing artificial intelligence (AI), researchers often benchmark against human performance as a measure of progress. Is this kind of comparison possible for moral cognition? Given that human moral judgment often hinges on intangible properties like "intention" which may have no natural analog in artificial agents, it may prove difficult to design a "like-for-like" comparison between the moral behavior of artificial and human agents. What would a measure of moral behavior for both humans and AI look like? We unravel the complexity of this question by discussing examples within reinforcement learning and generative AI, and we examine how the puzzle of evaluating artificial agents' moral cognition remains open for further investigation within cognitive science.

Keywords: Artificial intelligence; Moral cognition; Multi-agent reinforcement learning.

Publication types

  • Letter

MeSH terms

  • Artificial Intelligence*
  • Cognition*
  • Humans
  • Judgment
  • Learning
  • Morals