Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine

Biochim Biophys Acta Rev Cancer. 2021 Dec;1876(2):188588. doi: 10.1016/j.bbcan.2021.188588. Epub 2021 Jul 7.

Abstract

The recent deluge of genome-wide technologies for the mapping of the epigenome and resulting data in cancer samples has provided the opportunity for gaining insights into and understanding the roles of epigenetic processes in cancer. However, the complexity, high-dimensionality, sparsity, and noise associated with these data pose challenges for extensive integrative analyses. Machine Learning (ML) algorithms are particularly suited for epigenomic data analyses due to their flexibility and ability to learn underlying hidden structures. We will discuss four overlapping but distinct major categories under ML: dimensionality reduction, unsupervised methods, supervised methods, and deep learning (DL). We review the preferred use cases of these algorithms in analyses of cancer epigenomics data with the hope to provide an overview of how ML approaches can be used to explore fundamental questions on the roles of epigenome in cancer biology and medicine.

Keywords: Cancer; Chromatin; Deep learning; Epigenomics; Machine learning.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Biology / methods*
  • Epigenomics / methods*
  • Humans
  • Machine Learning / standards*
  • Medicine / methods*