Predicting haemoglobin deferral using machine learning models: Can we use the same prediction model across countries?

Vox Sang. 2024 Apr 18. doi: 10.1111/vox.13643. Online ahead of print.

Abstract

Background and objectives: Personalized donation strategies based on haemoglobin (Hb) prediction models may reduce Hb deferrals and hence costs of donation, meanwhile improving commitment of donors. We previously found that prediction models perform better in validation data with a high Hb deferral rate. We therefore investigate how Hb deferral prediction models perform when exchanged with other blood establishments.

Materials and methods: Donation data from the past 5 years from random samples of 10,000 donors from Australia, Belgium, Finland, the Netherlands and South Africa were used to fit random forest models for Hb deferral prediction. Trained models were exchanged between blood establishments. Model performance was evaluated using the area under the precision-recall curve (AUPR). Variable importance was assessed using SHapley Additive exPlanations (SHAP) values.

Results: Across the validation datasets and exchanged models, the AUPR ranged from 0.05 to 0.43. Exchanged models performed similarly within validation datasets, irrespective of the origin of the training data. Apart from subtle differences, the importance of most predictor variables was similar in all trained models.

Conclusion: Our results suggest that Hb deferral prediction models trained in different blood establishments perform similarly within different validation datasets, regardless of the deferral rate of their training data. Models learn similar associations in different blood establishments.

Keywords: donor health; haemoglobin deferral; haemoglobin measurement; prediction.