Contrastive Cross-Site Learning With Redesigned Net for COVID-19 CT Classification

IEEE J Biomed Health Inform. 2020 Oct;24(10):2806-2813. doi: 10.1109/JBHI.2020.3023246. Epub 2020 Sep 10.

Abstract

The pandemic of coronavirus disease 2019 (COVID-19) has lead to a global public health crisis spreading hundreds of countries. With the continuous growth of new infections, developing automated tools for COVID-19 identification with CT image is highly desired to assist the clinical diagnosis and reduce the tedious workload of image interpretation. To enlarge the datasets for developing machine learning methods, it is essentially helpful to aggregate the cases from different medical systems for learning robust and generalizable models. This paper proposes a novel joint learning framework to perform accurate COVID-19 identification by effectively learning with heterogeneous datasets with distribution discrepancy. We build a powerful backbone by redesigning the recently proposed COVID-Net in aspects of network architecture and learning strategy to improve the prediction accuracy and learning efficiency. On top of our improved backbone, we further explicitly tackle the cross-site domain shift by conducting separate feature normalization in latent space. Moreover, we propose to use a contrastive training objective to enhance the domain invariance of semantic embeddings for boosting the classification performance on each dataset. We develop and evaluate our method with two public large-scale COVID-19 diagnosis datasets made up of CT images. Extensive experiments show that our approach consistently improves the performanceson both datasets, outperforming the original COVID-Net trained on each dataset by 12.16% and 14.23% in AUC respectively, also exceeding existing state-of-the-art multi-site learning methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Betacoronavirus*
  • COVID-19
  • COVID-19 Testing
  • Clinical Laboratory Techniques / statistics & numerical data*
  • Computational Biology
  • Computer Systems
  • Coronavirus Infections / classification
  • Coronavirus Infections / diagnosis*
  • Coronavirus Infections / diagnostic imaging*
  • Databases, Factual / statistics & numerical data
  • Deep Learning*
  • Humans
  • Machine Learning
  • Pandemics* / classification
  • Pneumonia, Viral / classification
  • Pneumonia, Viral / diagnosis*
  • Pneumonia, Viral / diagnostic imaging*
  • Radiographic Image Interpretation, Computer-Assisted / statistics & numerical data
  • SARS-CoV-2
  • Tomography, X-Ray Computed / statistics & numerical data*

Grants and funding

This work was supported by a CUHK start-up research grant and CUHK Shun Hing Institute of Advanced Engineering (project MMT-p5-20).