Evaluating an automated machine learning model that predicts visual acuity outcomes in patients with neovascular age-related macular degeneration

Abdallah Abbas; Ciara O'Byrne; Dun Jack Fu; Gabriella Moraes; Konstantinos Balaskas; Robbert Struyven; Sara Beqiri; Siegfried K Wagner; Edward Korot; Pearse A Keane

doi:10.1007/s00417-021-05544-y

Evaluating an automated machine learning model that predicts visual acuity outcomes in patients with neovascular age-related macular degeneration

Graefes Arch Clin Exp Ophthalmol. 2022 Aug;260(8):2461-2473. doi: 10.1007/s00417-021-05544-y. Epub 2022 Feb 5.

Authors

Affiliations

¹ University College London Medical School, London, UK. Abdallah.abbas.18@ucl.ac.uk.
² Moorfields Eye Hospital, London, UK.
³ School of Medicine, Trinity College, Dublin, Ireland.
⁴ University College London Medical School, London, UK.
⁵ NIHR Biomedical Research Centre at Moorfields Eye Hospital, UCL Institute of Ophthalmology, London, UK.
⁶ Stanford University Byers Eye Institute, Palo Alto, CA, USA.

Abstract

Purpose: Neovascular age-related macular degeneration (nAMD) is a major global cause of blindness. Whilst anti-vascular endothelial growth factor (anti-VEGF) treatment is effective, response varies considerably between individuals. Thus, patients face substantial uncertainty regarding their future ability to perform daily tasks. In this study, we evaluate the performance of an automated machine learning (AutoML) model which predicts visual acuity (VA) outcomes in patients receiving treatment for nAMD, in comparison to a manually coded model built using the same dataset. Furthermore, we evaluate model performance across ethnic groups and analyse how the models reach their predictions.

Methods: Binary classification models were trained to predict whether patients' VA would be 'Above' or 'Below' a score of 70 one year after initiating treatment, measured using the Early Treatment Diabetic Retinopathy Study (ETDRS) chart. The AutoML model was built using the Google Cloud Platform, whilst the bespoke model was trained using an XGBoost framework. Models were compared and analysed using the What-if Tool (WIT), a novel model-agnostic interpretability tool.

Results: Our study included 1631 eyes from patients attending Moorfields Eye Hospital. The AutoML model (area under the curve [AUC], 0.849) achieved a highly similar performance to the XGBoost model (AUC, 0.847). Using the WIT, we found that the models over-predicted negative outcomes in Asian patients and performed worse in those with an ethnic category of Other. Baseline VA, age and ethnicity were the most important determinants of model predictions. Partial dependence plot analysis revealed a sigmoidal relationship between baseline VA and the probability of an outcome of 'Above'.

Conclusion: We have described and validated an AutoML-WIT pipeline which enables clinicians with minimal coding skills to match the performance of a state-of-the-art algorithm and obtain explainable predictions.

Keywords: Anti-VEGF; Artificial intelligence; Automated machine learning; Model interpretability; Neovascular age-related macular degeneration; OCT.

MeSH terms

Angiogenesis Inhibitors / therapeutic use
Humans
Intravitreal Injections
Machine Learning
Macular Degeneration* / drug therapy
Ranibizumab / therapeutic use
Retrospective Studies
Treatment Outcome
Vascular Endothelial Growth Factor A
Visual Acuity
Wet Macular Degeneration* / diagnosis
Wet Macular Degeneration* / drug therapy

Substances

Angiogenesis Inhibitors
Vascular Endothelial Growth Factor A
Ranibizumab

Abstract

MeSH terms

Substances

Grants and funding