Predicting muscle invasion in bladder cancer based on MRI: A comparison of radiomics, and single-task and multi-task deep learning

Comput Methods Programs Biomed. 2023 May:233:107466. doi: 10.1016/j.cmpb.2023.107466. Epub 2023 Mar 5.

Abstract

Background and objectives: Radiomics and deep learning are two popular technologies used to develop computer-aided detection and diagnosis schemes for analysing medical images. This study aimed to compare the effectiveness of radiomics, single-task deep learning (DL) and multi-task DL methods in predicting muscle-invasive bladder cancer (MIBC) status based on T2-weighted imaging (T2WI).

Methods: A total of 121 tumours (93 for training, from Centre 1; 28 for testing, from Centre 2) were included. MIBC was confirmed with pathological examination. A radiomics model, a single-task model, and a multi-task model based on T2WI were constructed in the training cohort with five-fold cross-validation, and validation was conducted in the external test cohort. Receiver operating characteristic (ROC) curve analysis was performed to evaluate the diagnostic performance of each model. DeLong's test and a permutation test were used to compare the performance of the models.

Results: The area under the ROC curve (AUC) values of the radiomics, single-task and multi-task models in the training cohort were: 0.920, 0.933 and 0.932, respectively; and were 0.844, 0.884 and 0.932, respectively, in the test cohort. The multi-task model achieved better performance in the test cohort than did the other models. No statistically significant differences in AUC values and Kappa coefficients were observed between pairwise models, in either the training or test cohorts. According to the Grad-CAM feature visualization results, the multi-task model focused more on the diseased tissue area in some samples of the test cohort compared with the single-task model.

Conclusions: The T2WI-based radiomics, single-task, and multi-task models all exhibited good diagnostic performance in preoperatively predicting MIBC, in which the multi-task model had the best diagnostic performance. Compared with the radiomics method, our multi-task DL method had the advantage of saving time and effort. Compared with the single-task DL method, our multi-task DL method had the advantage of being more lesion-focused and more reliable for clinical reference.

Keywords: Bladder cancer; Deep learning; Magnetic resonance imaging; Multi-task learning; Muscle invasion; Radiomics.

MeSH terms

  • Deep Learning*
  • Humans
  • Magnetic Resonance Imaging
  • Muscles / diagnostic imaging
  • ROC Curve
  • Retrospective Studies
  • Urinary Bladder Neoplasms* / diagnostic imaging