Uncovering Prognosis-Related Genes and Pathways by Multi-Omics Analysis in Lung Cancer

Biomolecules. 2020 Mar 30;10(4):524. doi: 10.3390/biom10040524.

Abstract

Lung cancer is one of the leading causes of death worldwide. Therefore, understanding the factors linked to patient survival is essential. Recently, multi-omics analysis has emerged, allowing for patient groups to be classified according to prognosis and at a more individual level, to support the use of precision medicine. Here, we combined RNA expression and miRNA expression with clinical information, to conduct a multi-omics analysis, using publicly available datasets (the cancer genome atlas (TCGA) focusing on lung adenocarcinoma (LUAD)). We were able to successfully subclass patients according to survival. The classifiers we developed, using inferred labels obtained from patient subtypes showed that a support vector machine (SVM), gave the best classification results, with an accuracy of 0.82 with the test dataset. Using these subtypes, we ranked genes based on RNA expression levels. The top 25 genes were investigated, to elucidate the mechanisms that underlie patient prognosis. Bioinformatics analyses showed that the expression levels of six out of 25 genes (ERO1B, DPY19L1, NCAM1, RET, MARCH1, and SLC7A8) were associated with LUAD patient survival (p < 0.05), and pathway analyses indicated that major cancer signaling was altered in the subtypes.

Keywords: lung cancer; multi-omics analysis; survival-associated genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma of Lung / diagnosis*
  • Adenocarcinoma of Lung / genetics*
  • Aged
  • Female
  • Genomics*
  • Humans
  • Kaplan-Meier Estimate
  • Male
  • Middle Aged
  • Prognosis
  • Support Vector Machine
  • Unsupervised Machine Learning