Reiterative modeling of combined transcriptomic and proteomic features refines and improves the prediction of early recurrence in squamous cell carcinoma of head and neck

Comput Biol Med. 2022 Oct:149:105991. doi: 10.1016/j.compbiomed.2022.105991. Epub 2022 Aug 18.

Abstract

Background: Patients with squamous cell carcinoma of the head and neck (SCCHN) have a high-risk of recurrence. We aimed to develop machine learning methods to identify transcriptomic and proteomic features that provide accurate classification models for predicting risk of early recurrence in SCCHN patients.

Methods: Clinical, genomic, transcriptomic and proteomic features distinguishing recurrence risk were examined in SCCHN patients from The Cancer Genome Atlas (TCGA). Recurrence within one year after treatment was classified as high-risk and no recurrence as low-risk.

Results: No significant differences in individual clinicopathological characteristics, mutation profiles or mRNA expression patterns were seen between the groups using conventional statistical analysis. Using the machine learning algorithm, extreme gradient boosting (XGBoost), ten proteins (RAD50, 4E-BP1, MYH11, MAP2K1, BECN1, NF2, RAB25, ERRFI1, KDR, SERPINE1) and five mRNAs (PLAUR, DKK1, AXIN2, ANG and VEGFA) made the greatest contribution to classification. These features were used to build improved models in XGBoost, achieving the best discrimination performance when combining transcriptomic and proteomic data, providing an accuracy of 0.939 and an Area Under the ROC Curve (AUC) of 0.951.

Conclusions: This study highlights machine learning to identify transcriptomic and proteomic factors that play important roles in predicting risk of recurrence in patients with SCCHN and to develop such models by iterative cycles to enhance their accuracy, thereby aiding the introduction of personalized treatment regimens.

Keywords: Early recurrence; Machine learning; Multi-omics; SCCHN; XGBoost.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Carcinoma, Squamous Cell* / genetics
  • Head and Neck Neoplasms* / genetics
  • Humans
  • Proteomics
  • RNA, Messenger / genetics
  • Squamous Cell Carcinoma of Head and Neck / genetics
  • Transcriptome / genetics
  • rab GTP-Binding Proteins / genetics

Substances

  • RNA, Messenger
  • Rab25 protein, human
  • rab GTP-Binding Proteins