Pathway-based biomarker identification with crosstalk analysis for robust prognosis prediction in hepatocellular carcinoma

EBioMedicine. 2019 Jun:44:250-260. doi: 10.1016/j.ebiom.2019.05.010. Epub 2019 May 14.
[Article in Spanish]

Abstract

Background: Although many prognostic single-gene (SG) lists have been identified in cancer research, application of these features is hampered due to poor robustness and performance on independent datasets. Pathway-based approaches have thus emerged which embed biological knowledge to yield reproducible features.

Methods: Pathifier estimates pathways deregulation score (PDS) to represent the extent of pathway deregulation based on expression data, and most of its applications treat pathways as independent without addressing the effect of gene overlap between pathway pairs which we refer to as crosstalk. Here, we propose a novel procedure based on Pathifier methodology, which for the first time has been utilized with crosstalk accommodated to identify disease-specific features to predict prognosis in patients with hepatocellular carcinoma (HCC).

Findings: With the cohort (N = 355) of HCC patients from The Cancer Genome Atlas (TCGA), cross validation (CV) revealed that PDSs identified were more robust and accurate than the SG features by deep learning (DL)-based approach. When validated on external HCC datasets, these features outperformed the SGs consistently.

Interpretation: On average, we provide 10.2% improvement of prediction accuracy. Importantly, governing genes in these features provide valuable insight into the cancer hallmarks of HCC. We develop an R package PATHcrosstalk (available from GitHub https://github.com/fabotao/PATHcrosstalk) with which users can discover pathways of interest with crosstalk effect considered.

Keywords: Biomarker; Crosstalk; Deep learning; Hepatocellular carcinoma; Overall survival; Pathway-based.

MeSH terms

  • Biomarkers, Tumor*
  • Carcinoma, Hepatocellular / genetics
  • Carcinoma, Hepatocellular / metabolism*
  • Carcinoma, Hepatocellular / mortality*
  • Computational Biology / methods
  • Databases, Genetic
  • Gene Expression Profiling
  • Gene Regulatory Networks
  • Humans
  • Liver Neoplasms / genetics*
  • Liver Neoplasms / metabolism*
  • Liver Neoplasms / mortality*
  • Prognosis
  • Reproducibility of Results
  • Signal Transduction*
  • Survival Analysis

Substances

  • Biomarkers, Tumor