A Latent Hidden Markov Model for Process Data

Xueying Tang

doi:10.1007/s11336-023-09938-1

A Latent Hidden Markov Model for Process Data

Psychometrika. 2024 Mar;89(1):205-240. doi: 10.1007/s11336-023-09938-1. Epub 2023 Nov 7.

Author

Xueying Tang¹

Affiliation

¹ University of Arizona, 617 N. Santa Rita Ave., Tucson, AZ , 85721, USA. xytang@math.arizona.edu.

PMID: 37934358
DOI: 10.1007/s11336-023-09938-1

Abstract

Response process data from computer-based problem-solving items describe respondents' problem-solving processes as sequences of actions. Such data provide a valuable source for understanding respondents' problem-solving behaviors. Recently, data-driven feature extraction methods have been developed to compress the information in unstructured process data into relatively low-dimensional features. Although the extracted features can be used as covariates in regression or other models to understand respondents' response behaviors, the results are often not easy to interpret since the relationship between the extracted features, and the original response process is often not explicitly defined. In this paper, we propose a statistical model for describing response processes and how they vary across respondents. The proposed model assumes a response process follows a hidden Markov model given the respondent's latent traits. The structure of hidden Markov models resembles problem-solving processes, with the hidden states interpreted as problem-solving subtasks or stages. Incorporating the latent traits in hidden Markov models enables us to characterize the heterogeneity of response processes across respondents in a parsimonious and interpretable way. We demonstrate the performance of the proposed model through simulation experiments and case studies of PISA process data.

Keywords: hidden Markov models; latent variable; problem-solving behaviors; response process.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Humans
Markov Chains*
Models, Statistical*
Problem Solving*
Psychometrics* / methods

Grants and funding

DMS-2310664/National Science Foundation