A multi-criteria evaluation platform for segmentation algorithms

Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug:2016:6441-6444. doi: 10.1109/EMBC.2016.7592203.

Abstract

The purpose of this paper is to present a platform for evaluating segmentation algorithms that detect anatomical structures in medical images. Structure detection being subject to human interpretation, we first describe a method to define a ground truth model, i.e. a generated bronze standard, that will be the reference for subsequent analysis. This bronze standard will be characterized in order to retrieve its confidence level that will later be used to normalize the algorithm evaluation. We then describe how the developed platform helps in evaluating algorithm performances described using five evaluation criteria: accuracy, reliability, robustness, under/over segmentation sensitivity and outlier sensitivity. First, we explain how to extract those evaluation criteria using specific normalized metrics commonly found in the literature, then we present how to combine all the information in order to get a global evaluation of segmentation algorithms. Lastly, a radar-style graph analysis is presented for easy multi-criteria interpretation.

MeSH terms

  • Algorithms*
  • Diagnostic Imaging*
  • Image Processing, Computer-Assisted / methods*
  • Reproducibility of Results