Image Embeddings Extracted from CNNs Outperform Other Transfer Learning Approaches in Classification of Chest Radiographs

Noemi Gozzi; Edoardo Giacomello; Martina Sollini; Margarita Kirienko; Angela Ammirabile; Pierluca Lanzi; Daniele Loiacono; Arturo Chiti

doi:10.3390/diagnostics12092084

Image Embeddings Extracted from CNNs Outperform Other Transfer Learning Approaches in Classification of Chest Radiographs

Diagnostics (Basel). 2022 Aug 28;12(9):2084. doi: 10.3390/diagnostics12092084.

Authors

Noemi Gozzi^{1

2}, Edoardo Giacomello³, Martina Sollini^{1

4}, Margarita Kirienko⁵, Angela Ammirabile^{1

4}, Pierluca Lanzi³, Daniele Loiacono³, Arturo Chiti^{1

4}

Affiliations

¹ IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy.
² Laboratory for Neuroengineering, Department of Health Sciences and Technology, Institute for Robotics and Intelligent Systems, ETH Zurich, 8092 Zurich, Switzerland.
³ Dipartimento di Elettronica, Informazione e Bioingegneria, Via Giuseppe Ponzio 34, 20133 Milan, Italy.
⁴ Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy.
⁵ Fondazione IRCCS Istituto Nazionale Tumori, Via G. Venezian 1, 20133 Milan, Italy.

Abstract

To identify the best transfer learning approach for the identification of the most frequent abnormalities on chest radiographs (CXRs), we used embeddings extracted from pretrained convolutional neural networks (CNNs). An explainable AI (XAI) model was applied to interpret black-box model predictions and assess its performance. Seven CNNs were trained on CheXpert. Three transfer learning approaches were thereafter applied to a local dataset. The classification results were ensembled using simple and entropy-weighted averaging. We applied Grad-CAM (an XAI model) to produce a saliency map. Grad-CAM maps were compared to manually extracted regions of interest, and the training time was recorded. The best transfer learning model was that which used image embeddings and random forest with simple averaging, with an average AUC of 0.856. Grad-CAM maps showed that the models focused on specific features of each CXR. CNNs pretrained on a large public dataset of medical images can be exploited as feature extractors for tasks of interest. The extracted image embeddings contain relevant information that can be used to train an additional classifier with satisfactory performance on an independent dataset, demonstrating it to be the optimal transfer learning strategy and overcoming the need for large private datasets, extensive computational resources, and long training times.

Keywords: X-rays; artificial intelligence; explainability; medical imaging; transfer learning.

Grants and funding

This research received no external funding.