W-Transformer: Accurate Cobb angles estimation by using a transformer-based hybrid structure

Med Phys. 2022 May;49(5):3246-3262. doi: 10.1002/mp.15561. Epub 2022 Mar 7.

Abstract

Background: Scoliosis is a type of spinal deformity, which is harmful to a person's health. In severe cases, it can trigger paralysis or death. The measurement of Cobb angle plays an essential role in assessing the severity of scoliosis.

Purpose: The aim of this paper is to propose an automatic system for landmark detection and Cobb angle estimation, which can effectively help clinicians diagnose and treat scoliosis.

Methods: A novel hybrid framework was proposed to measure Cobb angle precisely for clinical diagnosis, which was referred as W-Transformer due to its w-shaped architecture. First, a convolutional neural network of cascade residual blocks as our backbone was designed. Then a transformer was fused to learn the dependency information between spine and landmarks. In addition, a reinforcement branch was designed to improve the overlap of landmarks, and an improved prediction module was proposed to fine-tune the final coordinates of landmarks in Cobb angles estimation. Besides, the public Accurate Automated Spinal Curvature Estimation (AASCE) MICCAI 2019 challenge was served as data set. It supplies 609 manually labeled spine anterior-posterior (AP) X-ray images, each of which contains a total of 68 landmark labels and three Cobb Angles tags.

Results: From the perspective of the AASCE MICCAI 2019 challenge, we achieved a lower symmetric mean absolute percentage error (SMAPE) of 8.26% for all Cobb angles and the lowest averaged detection error of 50.89 in terms of landmark detection, compared with many state-of-the-art methods. We also provided the SMAPEs for the Cobb angles of the proximal-thoracic (PT), the main-thoracic (MT), and the thoracic-lumbar (TL) area, which are 5.27%, 14.59%, and 20.97% respectively, however, these data were not covered in most previous studies. Statistical analysis demonstrates that our model has obtained a high level of Pearson correlation coefficient of 0.9398 ( p < 0.001 $p&lt;0.001$ ), which shows excellent reliability of our model. Our model can yield 0.9489 ( p < 0.001 $p&lt;0.001$ ), 0.8817 ( p < 0.001 $p&lt;0.001$ ), and 0.9149 ( p < 0.001 $p&lt;0.001$ ) for PT, MT, and TL, respectively. The overall variability of Cobb angle measurement is less than 4 $^\circ$ , implying clinical value. And the mean absolute deviation (standard deviation) for three regions is 3.64 $^\circ$ (4.13 $^\circ$ ), 3.84 $^\circ$ (4.66 $^\circ$ ), and 3.80 $^\circ$ (4.19 $^\circ$ ). The results of Student paired t $t$ -test indicate that no statistically significant differences are observed between manual measurement and our automatic approach ( p $p$ -value is always > $&gt;$ 0.05). Regarding the diagnosis of scoliosis (Cobb angle > $&gt;$ 10 $^\circ$ ), the proposed method achieves a high sensitivity of 0.9577 and a specificity of 0.8475 for all spinal regions.

Conclusions: This study offers a brand-new automatic approach that is potentially of great benefit of the complex task of landmark detection and Cobb angle evaluation, which can provide helpful navigation information about the early diagnosis of scoliosis.

Keywords: Cobb angle; landmark; scoliosis.

MeSH terms

  • Humans
  • Neural Networks, Computer
  • Radiography
  • Reproducibility of Results
  • Scoliosis* / diagnostic imaging
  • Spine / diagnostic imaging