Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent

Sensors (Basel). 2022 Jul 28;22(15):5645. doi: 10.3390/s22155645.

Abstract

Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns.

Keywords: data imputation; data preprocessing; discrete missing data; momentum gradient descent algorithm; multilayer perceptron.

MeSH terms

  • Algorithms*
  • Data Mining
  • Motion
  • Neural Networks, Computer*
  • Research Design