Multivariate Time Series Imputation: An Approach Based on Dictionary Learning

Xiaomeng Zheng; Bogdan Dumitrescu; Jiamou Liu; Ciprian Doru Giurcăneanu

doi:10.3390/e24081057

Multivariate Time Series Imputation: An Approach Based on Dictionary Learning

Entropy (Basel). 2022 Jul 31;24(8):1057. doi: 10.3390/e24081057.

Authors

Xiaomeng Zheng¹, Bogdan Dumitrescu², Jiamou Liu³, Ciprian Doru Giurcăneanu¹

Affiliations

¹ Department of Statistics, University of Auckland, Auckland 1142, New Zealand.
² Department of Automatic Control and Computers, University Politehnica of Bucharest, 060042 Bucharest, Romania.
³ School of Computer Science, University of Auckland, Auckland 1142, New Zealand.

Abstract

The problem addressed by dictionary learning (DL) is the representation of data as a sparse linear combination of columns of a matrix called dictionary. Both the dictionary and the sparse representations are learned from the data. We show how DL can be employed in the imputation of multivariate time series. We use a structured dictionary, which is comprised of one block for each time series and a common block for all the time series. The size of each block and the sparsity level of the representation are selected by using information theoretic criteria. The objective function used in learning is designed to minimize either the sum of the squared errors or the sum of the magnitudes of the errors. We propose dimensionality reduction techniques for the case of high-dimensional time series. For demonstrating how the new algorithms can be used in practical applications, we conduct a large set of experiments on five real-life data sets. The missing data (MD) are simulated according to various scenarios where both the percentage of MD and the length of the sequences of MD are considered. This allows us to identify the situations in which the novel DL-based methods are superior to the existing methods.

Keywords: dictionary learning; imputation; information theoretic criteria; missing data; multivariate time series.

Grants and funding

PN-III-P4-PCE-2021-0154/Prof. Bogdan Dumitrescu was supported in part by a grant of the Ministry of Research, Innovation and Digitization, CNCS - UEFISCDI, project number PN-III-P4-PCE-2021-0154, within PNCDI III