Semi-Supervised Maximum Discriminative Local Margin for Gene Selection

Sci Rep. 2018 Jun 5;8(1):8619. doi: 10.1038/s41598-018-26806-6.

Abstract

In the present study, we introduce a novel semi-supervised method called the semi-supervised maximum discriminative local margin (semiMM) for gene selection in expression data. The semiMM is a "filter" approach that exploits local structure, variance, and mutual information. We first constructed a local nearest neighbour graph and divided this information into within-class and between-class local nearest neighbour graphs by weighing the edge between the two data points. The semiMM aims to discover the most discriminative features for classification via maximizing the local margin between the within-class and between-class data, the variance of all data, and the mutual information of features with class labels. Experiments on five publicly available gene expression datasets revealed the effectiveness of the proposed method compared to three state-of-the-art feature selection algorithms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Gene Expression Profiling / methods*
  • Genetic Predisposition to Disease*
  • Genetic Testing / methods*
  • Genome-Wide Association Study / methods*
  • Humans