Predicting haemoglobin deferral using machine learning models: Can we use the same prediction model across countries?

Amber Meulenbeld; Jarkko Toivonen; Marieke Vinkenoog; Tinus Brits; Ronel Swanevelder; Dorien de Clippel; Veerle Compernolle; Surendra Karki; Marijke Welvaert; Katja van den Hurk; Joost van Rosmalen; Emmanuel Lesaffre; Mart Janssen; Mikko Arvas

doi:10.1111/vox.13643

Predicting haemoglobin deferral using machine learning models: Can we use the same prediction model across countries?

Vox Sang. 2024 Apr 18. doi: 10.1111/vox.13643. Online ahead of print.

Authors

Amber Meulenbeld^{1

2

3}, Jarkko Toivonen⁴, Marieke Vinkenoog¹, Tinus Brits⁵, Ronel Swanevelder⁵, Dorien de Clippel⁶, Veerle Compernolle^{6

7}, Surendra Karki⁸, Marijke Welvaert⁸, Katja van den Hurk^{1

2

3}, Joost van Rosmalen^{9

10

11}, Emmanuel Lesaffre¹², Mart Janssen¹, Mikko Arvas⁴

Affiliations

¹ Donor Medicine Research, Sanquin Research, Amsterdam, The Netherlands.
² Department of Public and Occupational Health, Amsterdam UMC, Amsterdam, The Netherlands.
³ Amsterdam Public Health Research Institute, Amsterdam UMC, Amsterdam, The Netherlands.
⁴ Research and Development, Finnish Red Cross Blood Service, Helsinki, Finland.
⁵ Business Intelligence, South African National Blood Service, Johannesburg, South Africa.
⁶ Dienst voor het Bloed, Belgian Red Cross Ugent, Ghent, Belgium.
⁷ Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium.
⁸ Research and Development, Australian Red Cross Lifeblood, Sydney, Australia.
⁹ Department of Biostatistics, Erasmus MC, Rotterdam, The Netherlands.
¹⁰ Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands.
¹¹ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
¹² L-Biostat, KU Leuven, Leuven, Belgium.

PMID: 38637123
DOI: 10.1111/vox.13643

Abstract

Background and objectives: Personalized donation strategies based on haemoglobin (Hb) prediction models may reduce Hb deferrals and hence costs of donation, meanwhile improving commitment of donors. We previously found that prediction models perform better in validation data with a high Hb deferral rate. We therefore investigate how Hb deferral prediction models perform when exchanged with other blood establishments.

Materials and methods: Donation data from the past 5 years from random samples of 10,000 donors from Australia, Belgium, Finland, the Netherlands and South Africa were used to fit random forest models for Hb deferral prediction. Trained models were exchanged between blood establishments. Model performance was evaluated using the area under the precision-recall curve (AUPR). Variable importance was assessed using SHapley Additive exPlanations (SHAP) values.

Results: Across the validation datasets and exchanged models, the AUPR ranged from 0.05 to 0.43. Exchanged models performed similarly within validation datasets, irrespective of the origin of the training data. Apart from subtle differences, the importance of most predictor variables was similar in all trained models.

Conclusion: Our results suggest that Hb deferral prediction models trained in different blood establishments perform similarly within different validation datasets, regardless of the deferral rate of their training data. Models learn similar associations in different blood establishments.

Keywords: donor health; haemoglobin deferral; haemoglobin measurement; prediction.

Abstract

Grants and funding