ProcData: An R Package for Process Data Analysis

Psychometrika. 2021 Dec;86(4):1058-1083. doi: 10.1007/s11336-021-09798-7. Epub 2021 Aug 11.

Abstract

Process data refer to data recorded in log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents' response problem-solving behaviors. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for inspecting, processing, and analyzing process data. We define an S3 class 'proc' for organizing process data and extend generic methods summary and print for 'proc'. Feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for making predictions from neural-network-based sequence models. In addition, a real dataset of response processes from the climate control item in the 2012 Programme for International Student Assessment is included in the package.

Keywords: autoencoder; multidimensional scaling; process data analysis; sequence model.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computers*
  • Data Analysis*
  • Educational Measurement
  • Humans
  • Problem Solving
  • Psychometrics
  • Software