A Self-attention Graph Convolutional Network for Precision Multi-tumor Early Diagnostics with DNA Methylation Data

Interdiscip Sci. 2023 Sep;15(3):405-418. doi: 10.1007/s12539-023-00563-1. Epub 2023 May 29.

Abstract

DNA methylation-based precision tumor early diagnostics is emerging as state-of-the-art technology that could capture early cancer signs 3 ~ 5 years in advance, even for clinically homogenous groups. Presently, the sensitivity of early detection for many tumors is ~ 30%, which needs significant improvement. Nevertheless, based on the genome-wide DNA methylation data, one could comprehensively characterize tumors' entire molecular genetic landscape and their subtle differences. Therefore, novel high-performance methods must be modeled by considering unbiased information using excessively available DNA methylation data. To fill this gap, we have designed a computational model involving a self-attention graph convolutional network and multi-class classification support vector machine to identify the 11 most common cancers using DNA methylation data. The self-attention graph convolutional network automatically learns key methylation sites in a data-driven way. Then, multi-tumor early diagnostics is realized by training a multi-class classification support vector machine based on the selected methylation sites. We evaluated our model's performance through several data sets of experiments, and our results demonstrate the effectiveness of the selected key methylation sites, which are highly relevant for blood diagnosis. The pipeline of the self-attention graph convolutional network based computational framework.

Keywords: Graph convolutional network; Key methylation sites; Multi-tumor early diagnostics; Self-attention mechanism.

MeSH terms

  • DNA Methylation* / genetics
  • Humans
  • Neoplasms* / diagnosis
  • Neoplasms* / genetics
  • Protein Processing, Post-Translational
  • Support Vector Machine