COVID-19 Detection in CT/X-ray Imagery Using Vision Transformers

Mohamad Mahmoud Al Rahhal; Yakoub Bazi; Rami M Jomaa; Ahmad AlShibli; Naif Alajlan; Mohamed Lamine Mekhalfi; Farid Melgani

doi:10.3390/jpm12020310

COVID-19 Detection in CT/X-ray Imagery Using Vision Transformers

J Pers Med. 2022 Feb 18;12(2):310. doi: 10.3390/jpm12020310.

Authors

Mohamad Mahmoud Al Rahhal¹, Yakoub Bazi², Rami M Jomaa³, Ahmad AlShibli⁴, Naif Alajlan², Mohamed Lamine Mekhalfi⁴, Farid Melgani⁵

Affiliations

¹ Applied Computer Science Department, College of Applied Computer Science, King Saud University, Riyadh 11543, Saudi Arabia.
² Computer Engineering Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia.
³ Computer Science Department, College of Computer and Cyber Sciences, University of Prince Mugrin, Medina 42241, Saudi Arabia.
⁴ Computer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia.
⁵ Department of Information Engineering and Computer Science, University of Trento, 38123 Trento, Italy.

Abstract

The steady spread of the 2019 Coronavirus disease has brought about human and economic losses, imposing a new lifestyle across the world. On this point, medical imaging tests such as computed tomography (CT) and X-ray have demonstrated a sound screening potential. Deep learning methodologies have evidenced superior image analysis capabilities with respect to prior handcrafted counterparts. In this paper, we propose a novel deep learning framework for Coronavirus detection using CT and X-ray images. In particular, a Vision Transformer architecture is adopted as a backbone in the proposed network, in which a Siamese encoder is utilized. The latter is composed of two branches: one for processing the original image and another for processing an augmented view of the original image. The input images are divided into patches and fed through the encoder. The proposed framework is evaluated on public CT and X-ray datasets. The proposed system confirms its superiority over state-of-the-art methods on CT and X-ray data in terms of accuracy, precision, recall, specificity, and F1 score. Furthermore, the proposed system also exhibits good robustness when a small portion of training data is allocated.

Keywords: COVID-19; X-ray images; computed tomography; deep learning; vision transformer.

Grants and funding

RSP-2021/69/King Saud University