DeepHisCoM: deep learning pathway analysis using hierarchical structural component models

Brief Bioinform. 2022 Sep 20;23(5):bbac171. doi: 10.1093/bib/bbac171.

Abstract

Many statistical methods for pathway analysis have been used to identify pathways associated with the disease along with biological factors such as genes and proteins. However, most pathway analysis methods neglect the complex nonlinear relationship between biological factors and pathways. In this study, we propose a Deep-learning pathway analysis using Hierarchical structured CoMponent models (DeepHisCoM) that utilize deep learning to consider a nonlinear complex contribution of biological factors to pathways by constructing a multilayered model which accounts for hierarchical biological structure. Through simulation studies, DeepHisCoM was shown to have a higher power in the nonlinear pathway effect and comparable power for the linear pathway effect when compared to the conventional pathway methods. Application to hepatocellular carcinoma (HCC) omics datasets, including metabolomic, transcriptomic and metagenomic datasets, demonstrated that DeepHisCoM successfully identified three well-known pathways that are highly associated with HCC, such as lysine degradation, valine, leucine and isoleucine biosynthesis and phenylalanine, tyrosine and tryptophan. Application to the coronavirus disease-2019 (COVID-19) single-nucleotide polymorphism (SNP) dataset also showed that DeepHisCoM identified four pathways that are highly associated with the severity of COVID-19, such as mitogen-activated protein kinase (MAPK) signaling pathway, gonadotropin-releasing hormone (GnRH) signaling pathway, hypertrophic cardiomyopathy and dilated cardiomyopathy. Codes are available at https://github.com/chanwoo-park-official/DeepHisCoM.

Keywords: deep-learning; pathway analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Factors
  • COVID-19*
  • Carcinoma, Hepatocellular* / genetics
  • Deep Learning*
  • Gonadotropin-Releasing Hormone
  • Humans
  • Isoleucine
  • Leucine
  • Liver Neoplasms*
  • Lysine
  • Mitogen-Activated Protein Kinases
  • Phenylalanine
  • Tryptophan
  • Tyrosine
  • Valine

Substances

  • Biological Factors
  • Gonadotropin-Releasing Hormone
  • Isoleucine
  • Leucine
  • Lysine
  • Mitogen-Activated Protein Kinases
  • Phenylalanine
  • Tryptophan
  • Tyrosine
  • Valine