Analyzing categorical time series in the presence of missing observations

Stat Med. 2021 Sep 20;40(21):4675-4690. doi: 10.1002/sim.9089. Epub 2021 Jun 4.

Abstract

In real applications, time series often exhibit missing observations such that standard analytical tools cannot be applied. While there are approaches of how to handle missing data in quantitative time series, the case of categorical time series seems not to have been treated so far. Both for the case of ordinal and nominal time series, solutions are developed that allow to analyze their marginal and serial properties in the presence of missing observations. This is achieved by adapting the concept of amplitude modulation, which allows to obtain closed-form asymptotic expressions for the derived statistics' distribution (assuming that missingness happens independently of the actual process). The proposed methods are investigated with simulations, and they are applied in a project on migraine patients, where the monitored qualitative time series are often incomplete.

Keywords: dispersion; incomplete data; nominal time series; ordinal time series; serial dependence.