Development of Machine Learning Models for Prediction of Osteoporosis from Clinical Health Examination Data

Wen-Yu Ou Yang; Cheng-Chien Lai; Meng-Ting Tsou; Lee-Ching Hwang

doi:10.3390/ijerph18147635

Development of Machine Learning Models for Prediction of Osteoporosis from Clinical Health Examination Data

Int J Environ Res Public Health. 2021 Jul 18;18(14):7635. doi: 10.3390/ijerph18147635.

Authors

Wen-Yu Ou Yang¹, Cheng-Chien Lai², Meng-Ting Tsou^{3

4}, Lee-Ching Hwang^{3

5}

Affiliations

¹ Department of Neurology, Taipei Veterans General Hospital, Taipei City 11217, Taiwan.
² Department of Medicine, Taipei Veterans General Hospital, Taipei City 11217, Taiwan.
³ Department of Family Medicine, Mackay Memorial Hospital, Taipei City 10491, Taiwan.
⁴ Mackay Junior College of Medicine, Nursing and Management, Taipei City 11260, Taiwan.
⁵ Department of Medicine, Mackay Medical College, New Taipei City 252, Taiwan.

Abstract

Osteoporosis is treatable but often overlooked in clinical practice. We aimed to construct prediction models with machine learning algorithms to serve as screening tools for osteoporosis in adults over fifty years old. Additionally, we also compared the performance of newly developed models with traditional prediction models. Data were acquired from community-dwelling participants enrolled in health checkup programs at a medical center in Taiwan. A total of 3053 men and 2929 women were included. Models were constructed for men and women separately with artificial neural network (ANN), support vector machine (SVM), random forest (RF), k-nearest neighbor (KNN), and logistic regression (LoR) to predict the presence of osteoporosis. Area under receiver operating characteristic curve (AUROC) was used to compare the performance of the models. We achieved AUROC of 0.837, 0.840, 0.843, 0.821, 0.827 in men, and 0.781, 0.807, 0.811, 0.767, 0.772 in women, for ANN, SVM, RF, KNN, and LoR models, respectively. The ANN, SVM, RF, and LoR models in men, and the ANN, SVM, and RF models in women performed significantly better than the traditional Osteoporosis Self-Assessment Tool for Asians (OSTA) model. We have demonstrated that machine learning algorithms improve the performance of screening for osteoporosis. By incorporating the models in clinical practice, patients could potentially benefit from earlier diagnosis and treatment of osteoporosis.

Keywords: early detection; machine learning; osteoporosis; prediction model; screening tool.

MeSH terms

Adult
Female
Humans
Logistic Models
Machine Learning*
Male
Middle Aged
Neural Networks, Computer
Osteoporosis* / diagnosis
Osteoporosis* / epidemiology
Taiwan / epidemiology