Image-to-image translation of label-free molecular vibrational images for a histopathological review using the UNet+/seg-cGAN model

Yunjie He; Jiasong Li; Steven Shen; Kai Liu; Kelvin K Wong; Tiancheng He; Stephen T C Wong

doi:10.1364/BOE.445319

Image-to-image translation of label-free molecular vibrational images for a histopathological review using the UNet+/seg-cGAN model

Biomed Opt Express. 2022 Mar 8;13(4):1924-1938. doi: 10.1364/BOE.445319. eCollection 2022 Apr 1.

Authors

Yunjie He¹, Jiasong Li¹, Steven Shen², Kai Liu¹, Kelvin K Wong^{1

3}, Tiancheng He¹, Stephen T C Wong^{1

2

3}

Affiliations

¹ Translational Biophotonics Laboratory, Systems Medicine and Bioengineering Department, Houston Methodist Cancer Center, Houston, USA.
² Pathology and Genome Medicine Department, Houston Methodist Hospital, Weill Cornell Medicine, Houston, USA.
³ T.T. and W. F. Chao Center for BRAIN, Houston Methodist Academic Institute, USA.

Abstract

Translating images generated by label-free microscopy imaging, such as Coherent Anti-Stokes Raman Scattering (CARS), into more familiar clinical presentations of histopathological images will help the adoption of real-time, spectrally resolved label-free imaging in clinical diagnosis. Generative adversarial networks (GAN) have made great progress in image generation and translation, but have been criticized for lacking precision. In particular, GAN has often misinterpreted image information and identified incorrect content categories during image translation of microscopy scans. To alleviate this problem, we developed a new Pix2pix GAN model that simultaneously learns classifying contents in the images from a segmentation dataset during the image translation training. Our model integrates UNet+ with seg-cGAN, conditional generative adversarial networks with partial regularization of segmentation. Technical innovations of the UNet+/seg-cGAN model include: (1) replacing UNet with UNet+ as the Pix2pix cGAN's generator to enhance pattern extraction and richness of the gradient, and (2) applying the partial regularization strategy to train a part of the generator network as the segmentation sub-model on a separate segmentation dataset, thus enabling the model to identify correct content categories during image translation. The quality of histopathological-like images generated based on label-free CARS images has been improved significantly.