MVD-Net: Semantic Segmentation of Cataract Surgery Using Multi-View Learning

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul:2022:5035-5038. doi: 10.1109/EMBC48229.2022.9871673.

Abstract

Semantic segmentation of surgery scenarios is a fundamental task for computer-aided surgery systems. Precise segmentation of surgical instruments and anatomies contributes to capturing accurate spatial information for tracking. However, uneven reflection and class imbalance lead the segmentation in cataract surgery to a challenging task. To desirably conduct segmentation, a network with multi-view decoders (MVD-Net) is proposed to present a generalizable segmentation for cataract surgery. Two discrepant decoders are implemented to achieve multi-view learning with the backbone of U-Net. The experiment is carried out on the Cataract Dataset for Image Segmentation (CaDIS). The ablation study verifies the effectiveness of the proposed modules in MVD-Net, and superior performance is provided by MVD-Net in the comparison with the state-of-the-art methods. The source code will be publicly released.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cataract*
  • Deep Learning*
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer
  • Semantics