A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

Hayley Smith; Michael Sweeting; Tim Morris; Michael J Crowther

doi:10.1186/s41512-022-00124-y

A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

Diagn Progn Res. 2022 Jun 2;6(1):10. doi: 10.1186/s41512-022-00124-y.

Authors

Hayley Smith¹, Michael Sweeting^{2

3}, Tim Morris⁴, Michael J Crowther⁵

Affiliations

¹ Department of Health Sciences, University of Leicester, Leicester, LE1 7RH, UK. hrs18@leicester.ac.uk.
² Department of Health Sciences, University of Leicester, Leicester, LE1 7RH, UK.
³ Statistical Innovation, Oncology Biometrics, Oncology R&D, AstraZeneca, Cambridge, UK.
⁴ MRC Clinical Trials Unit at UCL, 90 High Holborn, London, WC1V 6LJ, UK.
⁵ Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.

Abstract

Background: There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading.

Methods: We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them.

Results: A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method's performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated.

Conclusion: It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular.

Keywords: Clinical risk prediction; Machine learning; Prognostic modelling; Simulation studies; Survival analysis.

Publication types

Review

Grants and funding

MC_UU_00004/07/MRC_/Medical Research Council/United Kingdom