Overall survival prediction of non-small cell lung cancer by integrating microarray and clinical data with deep learning

Sci Rep. 2020 Mar 13;10(1):4679. doi: 10.1038/s41598-020-61588-w.

Abstract

Non-small cell lung cancer (NSCLC) is one of the most common lung cancers worldwide. Accurate prognostic stratification of NSCLC can become an important clinical reference when designing therapeutic strategies for cancer patients. With this clinical application in mind, we developed a deep neural network (DNN) combining heterogeneous data sources of gene expression and clinical data to accurately predict the overall survival of NSCLC patients. Based on microarray data from a cohort set (614 patients), seven well-known NSCLC biomarkers were used to group patients into biomarker- and biomarker+ subgroups. Then, by using a systems biology approach, prognosis relevance values (PRV) were then calculated to select eight additional novel prognostic gene biomarkers. Finally, the combined 15 biomarkers along with clinical data were then used to develop an integrative DNN via bimodal learning to predict the 5-year survival status of NSCLC patients with tremendously high accuracy (AUC: 0.8163, accuracy: 75.44%). Using the capability of deep learning, we believe that our prediction can be a promising index that helps oncologists and physicians develop personalized therapy and build the foundation of precision medicine in the future.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Area Under Curve
  • Biomarkers, Tumor
  • Carcinoma, Non-Small-Cell Lung / diagnosis*
  • Carcinoma, Non-Small-Cell Lung / etiology
  • Carcinoma, Non-Small-Cell Lung / mortality*
  • Computational Biology* / methods
  • Deep Learning*
  • Humans
  • Kaplan-Meier Estimate
  • Lung Neoplasms / diagnosis*
  • Lung Neoplasms / etiology
  • Lung Neoplasms / mortality*
  • Microarray Analysis / methods
  • Neoplasm Grading
  • Neoplasm Metastasis
  • Neoplasm Staging
  • Reproducibility of Results
  • Support Vector Machine
  • Workflow

Substances

  • Biomarkers, Tumor