MVD-Net: Semantic Segmentation of Cataract Surgery Using Multi-View Learning

Mingyang Ou; Heng Li; Haofeng Liu; Xiaoxuan Wang; Chenlang Yi; Luoying Hao; Yan Hu; Jiang Liu

doi:10.1109/EMBC48229.2022.9871673

MVD-Net: Semantic Segmentation of Cataract Surgery Using Multi-View Learning

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul:2022:5035-5038. doi: 10.1109/EMBC48229.2022.9871673.

Authors

Mingyang Ou, Heng Li, Haofeng Liu, Xiaoxuan Wang, Chenlang Yi, Luoying Hao, Yan Hu, Jiang Liu

PMID: 36086003
DOI: 10.1109/EMBC48229.2022.9871673

Abstract

Semantic segmentation of surgery scenarios is a fundamental task for computer-aided surgery systems. Precise segmentation of surgical instruments and anatomies contributes to capturing accurate spatial information for tracking. However, uneven reflection and class imbalance lead the segmentation in cataract surgery to a challenging task. To desirably conduct segmentation, a network with multi-view decoders (MVD-Net) is proposed to present a generalizable segmentation for cataract surgery. Two discrepant decoders are implemented to achieve multi-view learning with the backbone of U-Net. The experiment is carried out on the Cataract Dataset for Image Segmentation (CaDIS). The ablation study verifies the effectiveness of the proposed modules in MVD-Net, and superior performance is provided by MVD-Net in the comparison with the state-of-the-art methods. The source code will be publicly released.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Cataract*
Deep Learning*
Humans
Image Processing, Computer-Assisted / methods
Neural Networks, Computer
Semantics