Breath analysis based early gastric cancer classification from deep stacked sparse autoencoder neural network

Sci Rep. 2021 Feb 17;11(1):4014. doi: 10.1038/s41598-021-83184-2.

Abstract

Deep learning is an emerging tool, which is regularly used for disease diagnosis in the medical field. A new research direction has been developed for the detection of early-stage gastric cancer. The computer-aided diagnosis (CAD) systems reduce the mortality rate due to their effectiveness. In this study, we proposed a new method for feature extraction using a stacked sparse autoencoder to extract the discriminative features from the unlabeled data of breath samples. A Softmax classifier was then integrated to the proposed method of feature extraction, to classify gastric cancer from the breath samples. Precisely, we identified fifty peaks in each spectrum to distinguish the EGC, AGC, and healthy persons. This CAD system reduces the distance between the input and output by learning the features and preserve the structure of the input data set of breath samples. The features were extracted from the unlabeled data of the breath samples. After the completion of unsupervised training, autoencoders with Softmax classifier were cascaded to develop a deep stacked sparse autoencoder neural network. In last, fine-tuning of the developed neural network was carried out with labeled training data to make the model more reliable and repeatable. The proposed deep stacked sparse autoencoder neural network architecture exhibits excellent results, with an overall accuracy of 98.7% for advanced gastric cancer classification and 97.3% for early gastric cancer detection using breath analysis. Moreover, the developed model produces an excellent result for recall, precision, and f score value, making it suitable for clinical application.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Algorithms
  • Asian People
  • Biomarkers, Tumor / analysis
  • Breath Tests / methods*
  • China
  • Computational Biology / methods
  • Data Accuracy
  • Deep Learning
  • Diagnosis, Computer-Assisted / methods
  • Early Detection of Cancer / methods*
  • Female
  • Humans
  • Male
  • Middle Aged
  • Neural Networks, Computer
  • Stomach Neoplasms / classification*
  • Stomach Neoplasms / diagnosis
  • Stomach Neoplasms / metabolism

Substances

  • Biomarkers, Tumor