Results Obtained from a Pivotal Validation Trial of a Microsatellite Analysis (MSA) Assay for Bladder Cancer Detection through a Statistical Approach Using a Four-Stage Pipeline of Modern Machine Learning Techniques

Int J Mol Sci. 2023 Dec 29;25(1):472. doi: 10.3390/ijms25010472.

Abstract

Several studies have shown that microsatellite changes can be profiled in urine for the detection of bladder cancer. The use of microsatellite analysis (MSA) for bladder cancer detection requires a comprehensive analysis of as many as 15 to 20 markers, based on the amplification and interpretations of many individual MSA markers, and it can be technically challenging. Here, to develop fast, more efficient, standardized, and less costly MSA for the detection of bladder cancer, we developed three multiplex-polymerase-chain-reaction-(PCR)-based MSA assays, all of which were analyzed via a genetic analyzer. First, we selected 16 MSA markers based on 9 selected publications. Based on samples from Johns Hopkins University (the JHU sample, the first set sample), we developed an MSA based on triplet, three-tube-based multiplex PCR (a Triplet MSA assay). The discovery, validation, and translation of biomarkers for the early detection of cancer are the primary focuses of the Early Detection Research Network (EDRN), an initiative of the National Cancer Institute (NCI). A prospective study sponsored by the EDRN was undertaken to determine the efficacy of a novel set of MSA markers for the early detection of bladder cancer. This work and data analysis were performed through a collaboration between academics and industry partners. In the current study, we undertook a re-analysis of the primary data from the Compass study to enhance the predictive power of the dataset in bladder cancer diagnosis. Using a four-stage pipeline of modern machine learning techniques, including outlier removal with a nonlinear model, correcting for majority/minority class imbalance, feature engineering, and the use of a model-derived variable importance measure to select predictors, we were able to increase the utility of the original dataset to predict the occurrence of bladder cancer. The results of this analysis showed an increase in accuracy (85%), sensitivity (82%), and specificity (83%) compared to the original analysis. The re-analysis of the EDRN study results using machine learning statistical analysis proved to achieve an appropriate level of accuracy, sensitivity, and specificity to support the use of the MSA for bladder cancer detection and monitoring. This assay can be a significant addition to the tools urologists use to both detect primary bladder cancers and monitor recurrent bladder cancer.

Keywords: Triplex MSA assay; bladder cancer; loss of heterozygosity; microsatellite instability.

MeSH terms

  • Humans
  • Machine Learning
  • Microsatellite Repeats / genetics
  • Prospective Studies
  • Urinary Bladder
  • Urinary Bladder Neoplasms* / diagnosis
  • Urinary Bladder Neoplasms* / genetics