DAWTran: dynamic adaptive windowing transformer network for pneumothorax segmentation with implicit feature alignment

Phys Med Biol. 2023 Aug 18;68(17). doi: 10.1088/1361-6560/aced79.

Abstract

Objective. This study aims to address the significant challenges posed by pneumothorax segmentation in computed tomography images due to the resemblance between pneumothorax regions and gas-containing structures such as the trachea and bronchus.Approach. We introduce a novel dynamic adaptive windowing transformer (DAWTran) network incorporating implicit feature alignment for precise pneumothorax segmentation. The DAWTran network consists of an encoder module, which employs a DAWTran, and a decoder module. We have proposed a unique dynamic adaptive windowing strategy that enables multi-head self-attention to effectively capture multi-scale information. The decoder module incorporates an implicit feature alignment function to minimize information deviation. Moreover, we utilize a hybrid loss function to address the imbalance between positive and negative samples.Main results. Our experimental results demonstrate that the DAWTran network significantly improves the segmentation performance. Specifically, it achieves a higher dice similarity coefficient (DSC) of 91.35% (a larger DSC value implies better performance), showing an increase of 2.21% compared to the TransUNet method. Meanwhile, it significantly reduces the Hausdorff distance (HD) to 8.06 mm (a smaller HD value implies better performance), reflecting a reduction of 29.92% in comparison to the TransUNet method. Incorporating the dynamic adaptive windowing (DAW) mechanism has proven to enhance DAWTran's performance, leading to a 4.53% increase in DSC and a 15.85% reduction in HD as compared to SwinUnet. The application of the implicit feature alignment (IFA) further improves the segmentation accuracy, increasing the DSC by an additional 0.11% and reducing the HD by another 10.01% compared to the model only employing DAW.Significance. These results highlight the potential of the DAWTran network for accurate pneumothorax segmentation in clinical applications, suggesting that it could be an invaluable tool in improving the precision and effectiveness of diagnosis and treatment in related healthcare scenarios. The improved segmentation performance with the inclusion of DAW and IFA validates the effectiveness of our proposed model and its components.

Keywords: dynamic adaptive windowing; implicit feature alignment; medical image analysis; pneumothorax segmentation; transformer.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bronchi
  • Humans
  • Image Processing, Computer-Assisted
  • Pneumothorax* / diagnostic imaging
  • Tomography, X-Ray Computed
  • Trachea