A 3D reconstruction based on an unsupervised domain adaptive for binocular endoscopy

Guo Zhang; Zhiwei Huang; Jinzhao Lin; Zhangyong Li; Enling Cao; Yu Pang; Weiwei Sun

doi:10.3389/fphys.2022.994343

A 3D reconstruction based on an unsupervised domain adaptive for binocular endoscopy

Front Physiol. 2022 Sep 1:13:994343. doi: 10.3389/fphys.2022.994343. eCollection 2022.

Authors

Guo Zhang^{1

2}, Zhiwei Huang^{1

2}, Jinzhao Lin³, Zhangyong Li⁴, Enling Cao⁵, Yu Pang³, Weiwei Sun³

Affiliations

¹ School of Communication and Information Engineering, Chongqing University of Posts and Telecommunication, Chongqing, China.
² School of Medical Information and Engineering, Southwest Medical University, Luzhou, China.
³ School of Optoelectronic Engineering, Chongqing University of Posts and Telecommunication, Chongqing, China.
⁴ School of Bioinformatics, Chongqing University of Posts and Telecommunication, Chongqing, China.
⁵ School of Software Engineering, Chongqing University of Posts and Telecommunication, Chongqing, China.

Abstract

In minimally invasive surgery, endoscopic image quality plays a crucial role in surgery. Aiming at the lack of a real parallax in binocular endoscopic images, this article proposes an unsupervised adaptive neural network. The network combines adaptive smoke removal, depth estimation of binocular endoscopic images, and the 3D display of high-quality endoscopic images. We simulated the smoke generated during surgery by artificially adding fog. The training images of U-Net fused by Laplacian pyramid are introduced to improve the network's ability to extract intermediate features. We introduce Convolutional Block Attention Module to obtain the optimal parameters of each layer of the network. We utilized the disparity transformation relationship between left- and right-eye images to combine the left-eye images with disparity in HS-Resnet to obtain virtual right-eye images as labels for self-supervised training. This method extracts and fuses the parallax images at different scale levels of the decoder, making the generated parallax images more complete and smoother. A large number of experimental research results show that the scheme can remove the smoke generated during the operation, effectively reconstruct the 3D image of the tissue structure of the binocular endoscope, and at the same time, preserve the contour, edge, detail, and texture of the blood vessels in the medical image. Compared with the existing similar schemes, various indicators have been greatly improved. It has good clinical application prospects.

Keywords: adaptive; binocular endoscopic; deep learning; smoke; three-dimensional.