Multiple imputation of maritime search and rescue data at multiple missing patterns

PLoS One. 2021 Jun 18;16(6):e0252129. doi: 10.1371/journal.pone.0252129. eCollection 2021.

Abstract

Based on the missing situation and actual needs of maritime search and rescue data, multiple imputation methods were used to construct complete data sets under different missing patterns. Probability density curves and overimputation diagnostics were used to explore the effects of multiple imputation. The results showed that the Data Augmentation (DA) algorithm had the characteristics of high operation efficiency and good imputation effect, but the algorithm was not suitable for data imputation when there was a high data missing rate. The EMB algorithm effectively restored the distribution of datasets with different data missing rates, and was less affected by the missing position; the EMB algorithm could obtain a good imputation effect even when there was a high data missing rate. Overimputation diagnostics could not only reflect the data imputation effect, but also show the correlation between different datasets, which was of great importance for deep data mining and imputation effect improvement. The Expectation-Maximization with Bootstrap (EMB) algorithm had a poor estimation effect on extreme data and failed to reflect the dataset's variability characteristics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Accidents / statistics & numerical data*
  • Algorithms*
  • Computer Simulation*
  • Data Interpretation, Statistical*
  • Humans
  • Models, Statistical*
  • Rescue Work / methods*

Grants and funding

This work was supported by the [National Science and Technology Support Program] under Grant [2015BAG20B01]; [National Key R&D Program of China] under Grant [2017YFC1404705].