A global learning with local preservation method for microarray data imputation

Comput Biol Med. 2016 Oct 1:77:76-89. doi: 10.1016/j.compbiomed.2016.08.005. Epub 2016 Aug 5.

Abstract

Microarray data suffer from missing values for various reasons, including insufficient resolution, image noise, and experimental errors. Because missing values can hinder downstream analysis steps that require complete data as input, it is crucial to be able to estimate the missing values. In this study, we propose a Global Learning with Local Preservation method (GL2P) for imputation of missing values in microarray data. GL2P consists of two components: a local similarity measurement module and a global weighted imputation module. The former uses a local structure preservation scheme to exploit as much information as possible from the observable data, and the latter is responsible for estimating the missing values of a target gene by considering all of its neighbors rather than a subset of them. Furthermore, GL2P imputes the missing values in ascending order according to the rate of missing data for each target gene to fully utilize previously estimated values. To validate the proposed method, we conducted extensive experiments on six benchmarked microarray datasets. We compared GL2P with eight state-of-the-art imputation methods in terms of four performance metrics. The experimental results indicate that GL2P outperforms its competitors in terms of imputation accuracy and better preserves the structure of differentially expressed genes. In addition, GL2P is less sensitive to the number of neighbors than other local learning-based imputation methods.

Keywords: Global learning; Local preservation; Microarray data; Missing value imputation; Regression model.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Humans
  • Machine Learning
  • Neoplasms / genetics
  • Neoplasms / metabolism
  • Oligonucleotide Array Sequence Analysis / methods*
  • Regression Analysis