Diagnostic Performance of Machine Learning-Derived OSA Prediction Tools in Large Clinical and Community-Based Samples

Steven J Holfinger; M Melanie Lyons; Brendan T Keenan; Diego R Mazzotti; Jesse Mindel; Greg Maislin; Peter A Cistulli; Kate Sutherland; Nigel McArdle; Bhajan Singh; Ning-Hung Chen; Thorarinn Gislason; Thomas Penzel; Fang Han; Qing Yun Li; Richard Schwab; Allan I Pack; Ulysses J Magalang

doi:10.1016/j.chest.2021.10.023

Diagnostic Performance of Machine Learning-Derived OSA Prediction Tools in Large Clinical and Community-Based Samples

Chest. 2022 Mar;161(3):807-817. doi: 10.1016/j.chest.2021.10.023. Epub 2021 Oct 27.

Authors

Steven J Holfinger¹, M Melanie Lyons², Brendan T Keenan³, Diego R Mazzotti⁴, Jesse Mindel², Greg Maislin³, Peter A Cistulli⁵, Kate Sutherland⁵, Nigel McArdle⁶, Bhajan Singh⁶, Ning-Hung Chen⁷, Thorarinn Gislason⁸, Thomas Penzel⁹, Fang Han¹⁰, Qing Yun Li¹¹, Richard Schwab³, Allan I Pack³, Ulysses J Magalang²

Affiliations

¹ Division of Pulmonary, Critical Care, and Sleep Medicine, The Ohio State University Wexner Medical Center, Columbus, OH. Electronic address: Steven.Holfinger@osumc.edu.
² Division of Pulmonary, Critical Care, and Sleep Medicine, The Ohio State University Wexner Medical Center, Columbus, OH.
³ Division of Sleep Medicine, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA.
⁴ Division of Medical Informatics, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS.
⁵ Charles Perkins Centre, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia; Department of Respiratory and Sleep Medicine, Royal North Shore Hospital Sydney, Sydney, NSW, Australia.
⁶ West Australian Sleep Disorders Research Institute, Sir Charles Gairdner Hospital, Nedlands, WA, Australia; School of Human Sciences, University of Western Australia, Crawley, WA, Australia.
⁷ Division of Pulmonary, Critical Care Medicine and Sleep Medicine, Chang Gung Memorial Hospital, Taoyuan City, Taiwan.
⁸ Department of Sleep Medicine, Landspitali University Hospital, Reykjavik, Iceland; Medical Faculty, University of Iceland, Reykjavik, Iceland.
⁹ Interdisciplinary Center of Sleep Medicine, Charité University Hospital, Berlin, Germany.
¹⁰ Department of Respiratory Medicine, Peking University, Beijing, China.
¹¹ Department of Respiratory and Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.

Abstract

Background: Prediction tools without patient-reported symptoms could facilitate widespread identification of OSA.

Research question: What is the diagnostic performance of OSA prediction tools derived from machine learning using readily available data without patient responses to questionnaires? Also, how do they compare with STOP-BANG, an OSA prediction tool, in clinical and community-based samples?

Study design and methods: Logistic regression and machine learning techniques, including artificial neural network (ANN), random forests (RF), and kernel support vector machine, were used to determine the ability of age, sex, BMI, and race to predict OSA status. A retrospective cohort of 17,448 subjects from sleep clinics within the international Sleep Apnea Global Interdisciplinary Consortium (SAGIC) were randomly split into training (n = 10,469) and validation (n = 6,979) sets. Model comparisons were performed by using the area under the receiver-operating curve (AUC). Trained models were compared with the STOP-BANG questionnaire in two prospective testing datasets: an independent clinic-based sample from SAGIC (n = 1,613) and a community-based sample from the Sleep Heart Health Study (n = 5,599).

Results: The AUCs (95% CI) of the machine learning models were significantly higher than logistic regression (0.61 [0.60-0.62]) in both the training and validation datasets (ANN, 0.68 [0.66-0.69]; RF, 0.68 [0.67-0.70]; and kernel support vector machine, 0.66 [0.65-0.67]). In the SAGIC testing sample, the ANN (0.70 [0.68-0.72]) and RF (0.70 [0.68-0.73]) models had AUCs similar to those of the STOP-BANG (0.71 [0.68-0.72]). In the Sleep Heart Health Study testing sample, the ANN (0.72 [0.71-0.74]) had AUCs similar to those of STOP-BANG (0.72 [0.70-0.73]).

Interpretation: OSA prediction tools using machine learning without patient-reported symptoms provide better diagnostic performance than logistic regression. In clinical and community-based samples, the symptomless ANN tool has diagnostic performance similar to that of a widely used prediction tool that includes patient symptoms. Machine learning-derived algorithms may have utility for widespread identification of OSA.

Keywords: OSA; artificial neural network; electronic medical record; kernel support vector machine; machine learning; prediction model; random forest.

Publication types

Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Humans
Machine Learning
Polysomnography
Prospective Studies
Retrospective Studies
Sleep Apnea, Obstructive* / diagnosis
Surveys and Questionnaires

Abstract

Publication types

MeSH terms

Grants and funding