Mitigating bias in deep learning for diagnosis of coronary artery disease from myocardial perfusion SPECT images

Eur J Nucl Med Mol Imaging. 2023 Jan;50(2):387-397. doi: 10.1007/s00259-022-05972-w. Epub 2022 Oct 4.

Abstract

Purpose: Artificial intelligence (AI) has high diagnostic accuracy for coronary artery disease (CAD) from myocardial perfusion imaging (MPI). However, when trained using high-risk populations (such as patients with correlating invasive testing), the disease probability can be overestimated due to selection bias. We evaluated different strategies for training AI models to improve the calibration (accurate estimate of disease probability), using external testing.

Methods: Deep learning was trained using 828 patients from 3 sites, with MPI and invasive angiography within 6 months. Perfusion was assessed using upright (U-TPD) and supine total perfusion deficit (S-TPD). AI training without data augmentation (model 1) was compared to training with augmentation (increased sampling) of patients without obstructive CAD (model 2), and patients without CAD and TPD < 2% (model 3). All models were tested in an external population of patients with invasive angiography within 6 months (n = 332) or low likelihood of CAD (n = 179).

Results: Model 3 achieved the best calibration (Brier score 0.104 vs 0.121, p < 0.01). Improvement in calibration was particularly evident in women (Brier score 0.084 vs 0.124, p < 0.01). In external testing (n = 511), the area under the receiver operating characteristic curve (AUC) was higher for model 3 (0.930), compared to U-TPD (AUC 0.897) and S-TPD (AUC 0.900, p < 0.01 for both).

Conclusion: Training AI models with augmentation of low-risk patients can improve calibration of AI models developed to identify patients with CAD, allowing more accurate assignment of disease probability. This is particularly important in lower-risk populations and in women, where overestimation of disease probability could significantly influence down-stream patient management.

Keywords: Calibration; Deep learning; Diagnostic accuracy; Model training; Sex-specific analysis.

Publication types

  • Comment

MeSH terms

  • Artificial Intelligence
  • Coronary Angiography
  • Coronary Artery Disease* / diagnostic imaging
  • Deep Learning*
  • Female
  • Humans
  • Myocardial Perfusion Imaging* / methods
  • Perfusion
  • Sensitivity and Specificity
  • Tomography, Emission-Computed, Single-Photon / methods