Computational frameworks integrating deep learning and statistical models in mining multimodal omics data

J Biomed Inform. 2024 Apr:152:104629. doi: 10.1016/j.jbi.2024.104629. Epub 2024 Mar 28.

Abstract

Background: In health research, multimodal omics data analysis is widely used to address important clinical and biological questions. Traditional statistical methods rely on the strong assumptions of distribution. Statistical methods such as testing and differential expression are commonly used in omics analysis. Deep learning, on the other hand, is an advanced computer science technique that is powerful in mining high-dimensional omics data for prediction tasks. Recently, integrative frameworks or methods have been developed for omics studies that combine statistical models and deep learning algorithms.

Methods and results: The aim of these integrative frameworks is to combine the strengths of both statistical methods and deep learning algorithms to improve prediction accuracy while also providing interpretability and explainability. This review report discusses the current state-of-the-art integrative frameworks, their limitations, and potential future directions in survival and time-to-event longitudinal analysis, dimension reduction and clustering, regression and classification, feature selection, and causal and transfer learning.

Keywords: Deep learning; End-to-end; Integrative framework; Multi-stage; Multimodal omics; Statistical methods.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Deep Learning*
  • Genomics* / methods
  • Models, Statistical

Grants and funding