Comparing the Clinical Viability of Automated Fundus Image Segmentation Methods

Gorana Gojić; Veljko B Petrović; Dinu Dragan; Dušan B Gajić; Dragiša Mišković; Vladislav Džinić; Zorka Grgić; Jelica Pantelić; Ana Oros

doi:10.3390/s22239101

Comparing the Clinical Viability of Automated Fundus Image Segmentation Methods

Sensors (Basel). 2022 Nov 23;22(23):9101. doi: 10.3390/s22239101.

Authors

Affiliations

¹ The Institute for Artificial Intelligence Research and Development of Serbia, 21102 Novi Sad, Serbia.
² Faculty of Technical Sciences, University of Novi Sad, 21102 Novi Sad, Serbia.
³ Eye Clinic Džinić, 21107 Novi Sad, Serbia.
⁴ Institute of Eye Diseases, University Clinical Center of Serbia, 11000 Belgrade, Serbia.
⁵ Institute of Neonatology, 11000 Belgrade, Serbia.

Abstract

Recent methods for automatic blood vessel segmentation from fundus images have been commonly implemented as convolutional neural networks. While these networks report high values for objective metrics, the clinical viability of recovered segmentation masks remains unexplored. In this paper, we perform a pilot study to assess the clinical viability of automatically generated segmentation masks in the diagnosis of diseases affecting retinal vascularization. Five ophthalmologists with clinical experience were asked to participate in the study. The results demonstrate low classification accuracy, inferring that generated segmentation masks cannot be used as a standalone resource in general clinical practice. The results also hint at possible clinical infeasibility in experimental design. In the follow-up experiment, we evaluate the clinical quality of masks by having ophthalmologists rank generation methods. The ranking is established with high intra-observer consistency, indicating better subjective performance for a subset of tested networks. The study also demonstrates that objective metrics are not correlated with subjective metrics in retinal segmentation tasks for the methods involved, suggesting that objective metrics commonly used in scientific papers to measure the method's performance are not plausible criteria for choosing clinically robust solutions.

Keywords: clinical viability; convolutional neural networks; fundus image; objective metrics; segmentation; segmentation mask; subjective assessment; subjective metrics.

MeSH terms

Algorithms*
Fundus Oculi
Image Processing, Computer-Assisted / methods
Neural Networks, Computer*
Pilot Projects

Abstract

MeSH terms

Grants and funding