Epistasis Analysis: Classification Through Machine Learning Methods

Methods Mol Biol. 2021:2212:337-345. doi: 10.1007/978-1-0716-0947-7_21.

Abstract

Complex disease is different from Mendelian disorders. Its development usually involves the interaction of multiple genes or the interaction between genes and the environment (i.e. epistasis). Although the high-throughput sequencing technologies for complex diseases have produced a large amount of data, it is extremely difficult to analyze the data due to the high feature dimension and the combination in the epistasis analysis. In this work, we introduce machine learning methods to effectively reduce the gene dimensionality, retain the key epistatic effects, and effectively characterize the relationship between epistatic effects and complex diseases.

Keywords: Classification; Epistasis; Feature selection; Machine learning; Model evaluation.

MeSH terms

  • Computational Biology / methods
  • Datasets as Topic
  • Epistasis, Genetic*
  • Humans
  • Machine Learning*
  • Models, Genetic*
  • Multifactor Dimensionality Reduction
  • Multifactorial Inheritance / genetics*
  • Polymorphism, Single Nucleotide*
  • Software