Approximating Intermediate Feature Maps of Self-Supervised Convolution Neural Network to Learn Hard Positive Representations in Chest Radiography

Kyungjin Cho; Ki Duk Kim; Jiheon Jeong; Yujin Nam; Jeeyoung Kim; Changyong Choi; Soyoung Lee; Gil-Sun Hong; Joon Beom Seo; Namkug Kim

doi:10.1007/s10278-024-01032-x

Approximating Intermediate Feature Maps of Self-Supervised Convolution Neural Network to Learn Hard Positive Representations in Chest Radiography

J Imaging Inform Med. 2024 Feb 21. doi: 10.1007/s10278-024-01032-x. Online ahead of print.

Authors

Kyungjin Cho^#¹, Ki Duk Kim^#², Jiheon Jeong¹, Yujin Nam¹, Jeeyoung Kim¹, Changyong Choi¹, Soyoung Lee¹, Gil-Sun Hong³, Joon Beom Seo³, Namkug Kim^{4

5}

Affiliations

¹ Department of Bioengineering, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, 88 Olympic-Ro 43-Gil Songpa-Gu, Seoul, 05505, South Korea.
² Department of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, 88 Olympic-Ro 43-Gil Songpa-Gu, Seoul, 05505, South Korea.
³ Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea.
⁴ Department of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, 88 Olympic-Ro 43-Gil Songpa-Gu, Seoul, 05505, South Korea. namkugkim@gmail.com.
⁵ Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea. namkugkim@gmail.com.

^# Contributed equally.

PMID: 38381382
DOI: 10.1007/s10278-024-01032-x

Abstract

Recent advances in contrastive learning have significantly improved the performance of deep learning models. In contrastive learning of medical images, dealing with positive representation is sometimes difficult because some strong augmentation techniques can disrupt contrastive learning owing to the subtle differences between other standardized CXRs compared to augmented positive pairs; therefore, additional efforts are required. In this study, we propose intermediate feature approximation (IFA) loss, which improves the performance of contrastive convolutional neural networks by focusing more on positive representations of CXRs without additional augmentations. The IFA loss encourages the feature maps of a query image and its positive pair to resemble each other by maximizing the cosine similarity between the intermediate feature outputs of the original data and the positive pairs. Therefore, we used the InfoNCE loss, which is commonly used loss to address negative representations, and the IFA loss, which addresses positive representations, together to improve the contrastive network. We evaluated the performance of the network using various downstream tasks, including classification, object detection, and a generative adversarial network (GAN) inversion task. The downstream task results demonstrated that IFA loss can improve the performance of effectively overcoming data imbalance and data scarcity; furthermore, it can serve as a perceptual loss encoder for GAN inversion. In addition, we have made our model publicly available to facilitate access and encourage further research and collaboration in the field.

Keywords: Chest radiograph (CXR); Contrastive learning; Hard negative representation; Hard positive representation; Self-supervised learning.