Exploring and validating the prognostic value of pathomics signatures and genomics in patients with cutaneous melanoma based on bioinformatics and deep learning

Med Phys. 2023 Nov;50(11):7049-7059. doi: 10.1002/mp.16748. Epub 2023 Sep 18.

Abstract

Background: Cutaneous melanoma (CM) is the most common malignant tumor of the skin. Our study aimed to investigate the prognostic value of pathomics signatures for CM by combining pathomics and genomics.

Purpose: The purpose of this study was to explore the potential application value of pathomics signatures.

Methods: Pathology full scans, clinical information, and genomics data for CM patients were downloaded from The Cancer Genome Atlas (TCGA) database. Exploratory data analysis (EDA) was used to visualize patient characteristics. Genes related to a poorer prognosis were screened through differential analysis. Survival analysis was performed to assess the prognostic value of gene and pathomics signatures. Artificial neural network (ANN) models predicted prognosis using signatures and genes. Correlation analysis was used to explore signature-gene links.

Results: The clinical traits for 468 CM samples and the genomic data and pathology images for 471 CM samples were obtained from the TCGA database. The EDA results combined with multiple machine learning (ML) models suggested that the top 5 clinical traits in terms of importance were age, biopsy site, T stage, N stage and overall disease stage, and the eight ML models had a precision lower than 0.56. A total of 60 differentially expressed genes were obtained by comparing sequencing data. A total of 413 available quantitative signatures of each pathomics image were obtained with CellProfile software. The precision of the binary classification model based on pathomics signatures was 0.99, with a loss value of 1.7119e-04. The precision of the binary classification model based on differentially expressed genes was 0.98, with a loss value of 0.1101. The precision of the binary classification model based on pathomics signatures and differentially expressed genes was 0.97, with a loss value of 0.2088. The survival analyses showed that the survival rate of the high-risk group based on gene expression and pathomics signatures was significantly lower than that of the low-risk group. A total of 222 pathomics signatures and 51 differentially expressed genes were analyzed for survival with p-values of less than 0.05. There was a certain correlation between some pathomics signatures and differential gene expression involving ANO2, LINC00158, NDNF, ADAMTS15, and ADGRB3, etc. CONCLUSION: This study evaluated the prognostic significance of pathomics signatures and differentially expressed genes in CM patients. Three ANN models were developed, and all achieved accuracy rates higher than 97%. Specifically, the pathomics signature-based ANN model maintained a remarkable accuracy of 99%. These findings highlight the CellProfile + ANN model as an excellent choice for prognostic prediction in CM patients. Furthermore, our correlation analysis experimentally demonstrated a preliminary link between disease quantification and qualitative changes. Among various features, including M stage and treatments received, special attention should be given to age, biopsy site, T stage, N stage, and overall disease stage in CM patients.

MeSH terms

  • Computational Biology
  • Deep Learning*
  • Genomics
  • Humans
  • Melanoma* / genetics
  • Melanoma, Cutaneous Malignant
  • Prognosis
  • Skin Neoplasms* / genetics