Class key feature extraction and fusion for 2D medical image segmentation

Dezhi Zhang; Xin Fan; Xiaojing Kang; Shengwei Tian; Guangli Xiao; Long Yu; Weidong Wu

doi:10.1002/mp.16636

Class key feature extraction and fusion for 2D medical image segmentation

Med Phys. 2024 Feb;51(2):1263-1276. doi: 10.1002/mp.16636. Epub 2023 Aug 8.

Authors

Dezhi Zhang¹, Xin Fan², Xiaojing Kang¹, Shengwei Tian^{2

3}, Guangli Xiao², Long Yu^{4

5}, Weidong Wu¹

Affiliations

¹ Department of Dermatology and Venereology, People's Hospital of Xinjiang Uygur Autonomous Region, Xinjiang Clinical Research Center For Dermatologic Diseases, Xinjiang Key Laboratory of Dermatology Research (XJYS1707), Urmuqi, China.
² College of Software, Xinjiang University, Urmuqi, Xinjiang, China.
³ Key Laboratory of Software Engineering Technology, College of Software, Xin Jiang University, Urumqi, China.
⁴ College of Network Center, Xinjiang University, Urumqi, China.
⁵ Signal and Signal Processing Laboratory, College of Information Science and Engineering, Xinjiang University, Urumqi, China.

PMID: 37552522
DOI: 10.1002/mp.16636

Abstract

Background: The size variation, complex semantic environment and high similarity in medical images often prevent deep learning models from achieving good performance.

Purpose: To overcome these problems and improve the model segmentation performance and generalizability.

Methods: We propose the key class feature reconstruction module (KCRM), which ranks channel weights and selects key features (KFs) that contribute more to the segmentation results for each class. Meanwhile, KCRM reconstructs all local features to establish the dependence relationship from local features to KFs. In addition, we propose the spatial gating module (SGM), which employs KFs to generate two spatial maps to suppress irrelevant regions, strengthening the ability to locate semantic objects. Finally, we enable the model to adapt to size variations by diversifying the receptive field.

Results: We integrate these modules into class key feature extraction and fusion network (CKFFNet) and validate its performance on three public medical datasets: CHAOS, UW-Madison, and ISIC2017. The experimental results show that our method achieves better segmentation results and generalizability than those of mainstream methods.

Conclusion: Through quantitative and qualitative research, the proposed module improves the segmentation results and enhances the model generalizability, making it suitable for application and expansion.

Keywords: feature extraction; fusion; medical images; ranking channels; semantic segmentation.

Abstract

Grants and funding