Feature selection methods for big data bioinformatics: A survey from the search perspective

Methods. 2016 Dec 1:111:21-31. doi: 10.1016/j.ymeth.2016.08.014. Epub 2016 Aug 31.

Abstract

This paper surveys main principles of feature selection and their recent applications in big data bioinformatics. Instead of the commonly used categorization into filter, wrapper, and embedded approaches to feature selection, we formulate feature selection as a combinatorial optimization or search problem and categorize feature selection methods into exhaustive search, heuristic search, and hybrid methods, where heuristic search methods may further be categorized into those with or without data-distilled feature ranking measures.

Keywords: Biomarkers; Classification; Clustering; Computational biology; Computational intelligence; Data mining; Evolutionary algorithms; Evolutionary computation; Fuzzy logic; Genetic algorithms; Machine learning; Microarray; Neural networks; Particle swarm optimization; Pattern recognition; Random forests; Rough sets; Soft computing; Support vector machines; Swarm intelligence.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computational Biology / methods*
  • Computational Biology / trends
  • Data Mining / methods*
  • Data Mining / trends
  • Humans
  • Software*