Methods and open-source toolkit for analyzing and visualizing challenge results

Manuel Wiesenfarth; Annika Reinke; Bennett A Landman; Matthias Eisenmann; Laura Aguilera Saiz; M Jorge Cardoso; Lena Maier-Hein; Annette Kopp-Schneider

doi:10.1038/s41598-021-82017-6

Methods and open-source toolkit for analyzing and visualizing challenge results

Sci Rep. 2021 Jan 27;11(1):2369. doi: 10.1038/s41598-021-82017-6.

Authors

Manuel Wiesenfarth¹, Annika Reinke², Bennett A Landman³, Matthias Eisenmann², Laura Aguilera Saiz², M Jorge Cardoso⁴, Lena Maier-Hein^#⁵, Annette Kopp-Schneider^#⁶

Affiliations

¹ Division of Biostatistics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 581, Heidelberg, 69120, Germany. m.wiesenfarth@dkfz-heidelberg.de.
² Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 223, 69120, Heidelberg, Germany.
³ Electrical Engineering, Vanderbilt University, Nashville, TN, 37235-1679, USA.
⁴ School of Biomedical Engineering and Imaging Sciences, King's College London, London, WC2R 2LS, UK.
⁵ Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 223, 69120, Heidelberg, Germany. l.maier-hein@dkfz-heidelberg.de.
⁶ Division of Biostatistics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 581, Heidelberg, 69120, Germany.

^# Contributed equally.

Abstract

Grand challenges have become the de facto standard for benchmarking image analysis algorithms. While the number of these international competitions is steadily increasing, surprisingly little effort has been invested in ensuring high quality design, execution and reporting for these international competitions. Specifically, results analysis and visualization in the event of uncertainties have been given almost no attention in the literature. Given these shortcomings, the contribution of this paper is two-fold: (1) we present a set of methods to comprehensively analyze and visualize the results of single-task and multi-task challenges and apply them to a number of simulated and real-life challenges to demonstrate their specific strengths and weaknesses; (2) we release the open-source framework challengeR as part of this work to enable fast and wide adoption of the methodology proposed in this paper. Our approach offers an intuitive way to gain important insights into the relative and absolute performance of algorithms, which cannot be revealed by commonly applied visualization techniques. This is demonstrated by the experiments performed in the specific context of biomedical image analysis challenges. Our framework could thus become an important tool for analyzing and visualizing challenge results in the field of biomedical image analysis and beyond.

Publication types

Research Support, Non-U.S. Gov't

Grants and funding

P50 HD103537/HD/NICHD NIH HHS/United States