Lesion detection of chest X-Ray based on scalable attention residual CNN

Math Biosci Eng. 2023 Jan;20(2):1730-1749. doi: 10.3934/mbe.2023079. Epub 2022 Nov 4.

Abstract

Most of the research on disease recognition in chest X-rays is limited to segmentation and classification, but the problem of inaccurate recognition in edges and small parts makes doctors spend more time making judgments. In this paper, we propose a lesion detection method based on a scalable attention residual CNN (SAR-CNN), which uses target detection to identify and locate diseases in chest X-rays and greatly improves work efficiency. We designed a multi-convolution feature fusion block (MFFB), tree-structured aggregation module (TSAM), and scalable channel and spatial attention (SCSA), which can effectively alleviate the difficulties in chest X-ray recognition caused by single resolution, weak communication of features of different layers, and lack of attention fusion, respectively. These three modules are embeddable and can be easily combined with other networks. Through a large number of experiments on the largest public lung chest radiograph detection dataset, VinDr-CXR, the mean average precision (mAP) of the proposed method was improved from 12.83% to 15.75% in the case of the PASCAL VOC 2010 standard, with IoU > 0.4, which exceeds the existing mainstream deep learning model. In addition, the proposed model has a lower complexity and faster reasoning speed, which is conducive to the implementation of computer-aided systems and provides referential solutions for relevant communities.

Keywords: attention mechanism; chest X-ray; deep learning; disease recognition; object detection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Lung
  • Neural Networks, Computer*
  • Radiography, Thoracic / methods
  • X-Rays