Medical Image Classification Using Light-weight CNN with Spiking Cortical Model Based Attention Module

IEEE J Biomed Health Inform. 2023 Feb 1:PP. doi: 10.1109/JBHI.2023.3241439. Online ahead of print.

Abstract

In the field of disease diagnosis where only a small dataset of medical images may be accessible, the light-weight convolutional neural network (CNN) has become popular because it can help to avoid the over-fitting problem and improve computational efficiency. However, the feature extraction capability of the light-weight CNN is inferior to that of the heavy-weight counterpart. Although the attention mechanism provides a feasible solution to this problem, the existing attention modules, such as the squeeze and excitation module and the convolutional block attention module, have insufficient non-linearity, thereby influencing the ability of the light-weight CNN to discover the key features. To address this issue, we have proposed a spiking cortical model based global and local (SCM-GL) attention module. The SCM-GL module analyzes the input feature maps in parallel and decomposes each map into several components according to the relation between pixels and their neighbors. The components are weighted summed to obtain a local mask. Besides, a global mask is produced by discovering the correlation between the distant pixels in the feature map. The final attention mask is generated by combining the local and global masks, and it is multiplied by the original map so that the important components can be highlighted to facilitate accurate disease diagnosis. To appreciate the performance of the SCM-GL module, this module and some mainstream attention modules have been embedded into the popular light-weight CNN models for comparison. Experiments on the classification of brain MR, chest X-ray, and osteosarcoma image datasets demonstrate that the SCM-GL module can significantly improve the classification performance of the evaluated light-weight CNN models by enhancing the ability of discovering the suspected lesions and it is generally superior to state-of-the-art attention modules in terms of accuracy, recall, specificity and F1 score.