Brief Survey on Machine Learning in Epistasis

Methods Mol Biol. 2021:2212:169-179. doi: 10.1007/978-1-0716-0947-7_11.

Abstract

In biology, the term "epistasis" indicates the effect of the interaction of a gene with another gene. A gene can interact with an independently sorted gene, located far away on the chromosome or on an entirely different chromosome, and this interaction can have a strong effect on the function of the two genes. These changes then can alter the consequences of the biological processes, influencing the organism's phenotype. Machine learning is an area of computer science that develops statistical methods able to recognize patterns from data. A typical machine learning algorithm consists of a training phase, where the model learns to recognize specific trends in the data, and a test phase, where the trained model applies its learned intelligence to recognize trends in external data. Scientists have applied machine learning to epistasis problems multiple times, especially to identify gene-gene interactions from genome-wide association study (GWAS) data. In this brief survey, we report and describe the main scientific articles published in data mining and epistasis. Our article confirms the effectiveness of machine learning in this genetics subfield.

Keywords: Epistasis; Gene–gene interactions; Machine learning; Overview; Review; Survey.

Publication types

  • Review

MeSH terms

  • Alzheimer Disease / genetics
  • Alzheimer Disease / metabolism
  • Alzheimer Disease / pathology
  • Computational Biology / methods*
  • Crohn Disease / genetics
  • Crohn Disease / metabolism
  • Crohn Disease / pathology
  • Data Mining / methods*
  • Epistasis, Genetic*
  • Genome, Human
  • Genome-Wide Association Study
  • Humans
  • Inheritance Patterns
  • Machine Learning*
  • Macular Degeneration / genetics
  • Macular Degeneration / metabolism
  • Macular Degeneration / pathology
  • Neoplasms / genetics
  • Neoplasms / metabolism
  • Neoplasms / pathology
  • Phenotype
  • Plants / genetics
  • Polymorphism, Single Nucleotide
  • Quantitative Trait, Heritable*