Differentially expressed genes selection via Laplacian regularized low-rank representation method

Comput Biol Chem. 2016 Dec:65:185-192. doi: 10.1016/j.compbiolchem.2016.09.014. Epub 2016 Sep 28.

Abstract

With the rapid development of DNA microarray technology and next-generation technology, a large number of genomic data were generated. So how to extract more differentially expressed genes from genomic data has become a matter of urgency. Because Low-Rank Representation (LRR) has the high performance in studying low-dimensional subspace structures, it has attracted a chunk of attention in recent years. However, it does not take into consideration the intrinsic geometric structures in data. In this paper, a new method named Laplacian regularized Low-Rank Representation (LLRR) has been proposed and applied on genomic data, which introduces graph regularization into LRR. By taking full advantages of the graph regularization, LLRR method can capture the intrinsic non-linear geometric information among the data. The LLRR method can decomposes the observation matrix of genomic data into a low rank matrix and a sparse matrix through solving an optimization problem. Because the significant genes can be considered as sparse signals, the differentially expressed genes are viewed as the sparse perturbation signals. Therefore, the differentially expressed genes can be selected according to the sparse matrix. Finally, we use the GO tool to analyze the selected genes and compare the P-values with other methods. The results on the simulation data and two real genomic data illustrate that this method outperforms some other methods: in differentially expressed gene selection.

Keywords: Differentially expressed genes; Genomic data; Graph regularization; Low-rank representation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Regulation*
  • Models, Theoretical*