VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image

Junjie Liu; Shixin Xu; Ping He; Sirong Wu; Xi Luo; Yuhui Deng; Huaxiong Huang

doi:10.1016/j.bpj.2024.02.019

VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image

Biophys J. 2024 Feb 27:S0006-3495(24)00139-5. doi: 10.1016/j.bpj.2024.02.019. Online ahead of print.

Authors

Junjie Liu¹, Shixin Xu², Ping He³, Sirong Wu⁴, Xi Luo⁴, Yuhui Deng⁵, Huaxiong Huang⁶

Affiliations

¹ Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Zhuhai, China; BNU-HKBU United International College, Zhuhai, China; Faculty of Science, Hong Kong Baptist University, Hong Kong SAR, China; Trinity College Dublin, Dublin 2, Ireland.
² Data Science Research Center, Duke Kunshan University, Kunshan, Jiangsu, China.
³ Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Zhuhai, China; BNU-HKBU United International College, Zhuhai, China.
⁴ Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Zhuhai, China; BNU-HKBU United International College, Zhuhai, China; Faculty of Science, Hong Kong Baptist University, Hong Kong SAR, China.
⁵ Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Zhuhai, China; BNU-HKBU United International College, Zhuhai, China. Electronic address: ivandeng@uic.edu.cn.
⁶ Research Center for Mathematics, Beijing Normal University, Zhuhai, China; Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Zhuhai, China; Department of Mathematics and Statistics, York University, Toronto, ON, Canada. Electronic address: hhuang@uic.edu.cn.

PMID: 38414236
DOI: 10.1016/j.bpj.2024.02.019

Abstract

In recent years, advancements in retinal image analysis, driven by machine learning and deep learning techniques, have enhanced disease detection and diagnosis through automated feature extraction. However, challenges persist, including limited data set diversity due to privacy concerns and imbalanced sample pairs, hindering effective model training. To address these issues, we introduce the vessel and style guided generative adversarial network (VSG-GAN), an innovative algorithm building upon the foundational concept of GAN. In VSG-GAN, a generator and discriminator engage in an adversarial process to produce realistic retinal images. Our approach decouples retinal image generation into distinct modules: the vascular skeleton and background style. Leveraging style transformation and GAN inversion, our proposed hierarchical variational autoencoder module generates retinal images with diverse morphological traits. In addition, the spatially adaptive denormalization module ensures consistency between input and generated images. We evaluate our model on MESSIDOR and RITE data sets using various metrics, including structural similarity index measure, inception score, Fréchet inception distance, and kernel inception distance. Our results demonstrate the superiority of VSG-GAN, outperforming existing methods across all evaluation assessments. This underscores its effectiveness in addressing data set limitations and imbalances. Our algorithm provides a novel solution to challenges in retinal image analysis by offering diverse and realistic retinal image generation. Implementing the VSG-GAN augmentation approach on downstream diabetic retinopathy classification tasks has shown enhanced disease diagnosis accuracy, further advancing the utility of machine learning in this domain.