Identification of key biomarkers for STAD using filter feature selection approaches

Sci Rep. 2022 Nov 18;12(1):19854. doi: 10.1038/s41598-022-21760-w.

Abstract

Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer death worldwide. Discovery of diagnostic biomarkers prompts the early detection of GC. In this study, we used limma method combined with joint mutual information (JMI), a machine learning algorithm, to identify a signature of 11 genes that performed well in distinguishing tumor and normal samples in a stomach adenocarcinoma cohort. Other two GC datasets were used to validate the classifying performances. Several of the candidate genes were correlated with GC tumor progression and survival. Overall, we highlight the application of feature selection approaches in the analysis of high-dimensional biological data, which will improve study accuracies and reduce workloads for the researchers when identifying potential tumor biomarkers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma*
  • Algorithms
  • Biomarkers, Tumor / genetics
  • Computational Biology / methods
  • Humans
  • Stomach Neoplasms* / diagnosis
  • Stomach Neoplasms* / genetics

Substances

  • Biomarkers, Tumor