External validation of a convolutional neural network algorithm for opportunistically detecting vertebral fractures in routine CT scans

Osteoporos Int. 2024 Jan;35(1):143-152. doi: 10.1007/s00198-023-06903-7. Epub 2023 Sep 7.

Abstract

The Convolutional Neural Network algorithm achieved a sensitivity of 94% and specificity of 93% in identifying scans with vertebral fractures (VFs). The external validation results suggest that the algorithm provides an opportunity to aid radiologists with the early identification of VFs in routine CT scans of abdomen and chest.

Purpose: To evaluate the performance of a previously trained Convolutional Neural Network (CNN) model to automatically detect vertebral fractures (VFs) in CT scans in an external validation cohort.

Methods: Two Chinese studies and clinical data were used to retrospectively select CT scans of the chest, abdomen and thoracolumbar spine in men and women aged ≥50 years. The CT scans were assessed using the semiquantitative (SQ) Genant classification for prevalent VFs in a process blinded to clinical information. The performance of the CNN model was evaluated against reference standard readings by the area under the receiver operating characteristics curve (AUROC), accuracy, Cohen's kappa, sensitivity, and specificity.

Results: A total of 4,810 subjects were included, with a median age of 62 years (IQR 56-67), of which 2,654 (55.2%) were females. The scans were acquired between January 2013 and January 2019 on 16 different CT scanners from three different manufacturers. 2,773 (57.7%) were abdominal CTs. A total of 628 scans (13.1%) had ≥1 VF (grade 2-3), representing 899 fractured vertebrae out of a total of 48,584 (1.9%) visualized vertebral bodies. The CNN's performance in identifying scans with ≥1 moderate or severe fractures achieved an AUROC of 0.94 (95% CI: 0.93-0.95), accuracy of 93% (95% CI: 93%-94%), kappa of 0.75 (95% CI: 0.72-0.77), a sensitivity of 94% (95% CI: 92-96%) and a specificity of 93% (95% CI: 93-94%).

Conclusion: The algorithm demonstrated excellent performance in the identification of vertebral fractures in a cohort of chest and abdominal CT scans of Chinese patients ≥50 years.

Keywords: Convolutional neural network; Osteoporosis; Predictive value of tests; Tomography, X-Ray computed; Vertebral body.

MeSH terms

  • Aged
  • Algorithms
  • Female
  • Humans
  • Male
  • Middle Aged
  • Neural Networks, Computer
  • Retrospective Studies
  • Spinal Fractures* / diagnostic imaging
  • Tomography, X-Ray Computed / methods