Attention mechanism and mixup data augmentation for classification of COVID-19 Computed Tomography images

J King Saud Univ Comput Inf Sci. 2022 Sep;34(8):6199-6207. doi: 10.1016/j.jksuci.2021.07.005. Epub 2021 Jul 15.

Abstract

The Coronavirus disease is quickly spreading all over the world and the emergency situation is still out of control. Latest achievements of deep learning algorithms suggest the use of deep Convolutional Neural Network to implement a computer-aided diagnostic system for automatic classification of COVID-19 CT images. In this paper, we propose to employ a feature-wise attention layer in order to enhance the discriminative features obtained by convolutional networks. Moreover, the original performance of the network has been improved using the mixup data augmentation technique. This work compares the proposed attention-based model against the stacked attention networks, and traditional versus mixup data augmentation approaches. We deduced that feature-wise attention extension, while outperforming the stacked attention variants, achieves remarkable improvements over the baseline convolutional neural networks. That is, ResNet50 architecture extended with a feature-wise attention layer obtained 95.57% accuracy score, which, to best of our knowledge, fixes the state-of-the-art in the challenging COVID-CT dataset.

Keywords: Attention; COVID-19; Classification; Computed Tomography (CT) images; Data augmentation; Mixup.