Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied

Guillaume Weber-Boisvert; Benoit Gosselin; Frida Sandberg

doi:10.3389/fphys.2023.1126957

Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied

Front Physiol. 2023 Mar 2:14:1126957. doi: 10.3389/fphys.2023.1126957. eCollection 2023.

Authors

Guillaume Weber-Boisvert¹, Benoit Gosselin¹, Frida Sandberg²

Affiliations

¹ Department of Electrical and Computer Engineering, Université Laval, Quebec, QC, Canada.
² Department of Biomedical Engineering, Lund University, Lund, Sweden.

Abstract

The large MIMIC waveform dataset, sourced from intensive care units, has been used extensively for the development of Photoplethysmography (PPG) based blood pressure (BP) estimation algorithms. Yet, because the data comes from patients in severe conditions-often under the effect of drugs-it is regularly noted that the relationship between BP and PPG signal characteristics may be anomalous, a claim that we investigate here. A sample of 12,000 records from the MIMIC waveform dataset was stacked up against the 219 records of the PPG-BP dataset, an alternative public dataset obtained under controlled experimental conditions. The distribution of systolic and diastolic BP data and 31 PPG pulse morphological features was first compared between datasets. Then, the correlation between features and BP, as well as between the features themselves, was analysed. Finally, regression models were trained for each dataset and validated against the other. Statistical analysis showed significant $(p < 0.001)$ differences between the datasets in diastolic BP and in 20 out of 31 features when adjusting for heart rate differences. The eight features showing the highest rank correlation $((ρ) > 0.40)$ to systolic BP in PPG-BP all displayed muted correlation levels $((ρ) < 0.10)$ in MIMIC. Regression tests showed twice higher baseline predictive power with PPG-BP than with MIMIC. Cross-dataset regression displayed a practically complete loss of predictive power for all models. The differences between the MIMIC and PPG-BP dataset exposed in this study suggest that BP estimation models based on the MIMIC dataset have reduced predictive power on the general population.

Keywords: BP estimation; PPG datasets; PPG-BP; UCI; blood pressure estimation; intensive care datasets; mimic; photoplethysmography.

Grants and funding

This research project was supported by the SMAART Program, Sentinel North and by the Canada Research Chair in Smart Biomedical Microsystems. Frida Sandberg was supported by the Swedish Research Council (Grant VR 2019-04272), and the Crafoord Foundation (Grant 20200605).