Machine-learning classifier models for predicting sarcopenia in the elderly based on physical factors

Geriatr Gerontol Int. 2024 Jun;24(6):595-602. doi: 10.1111/ggi.14895. Epub 2024 May 14.

Abstract

Aim: As the size of the elderly population gradually increases, musculoskeletal disorders, such as sarcopenia, are increasing. Diagnostic techniques such as X-rays, computed tomography, and magnetic resonance imaging are used to predict and diagnose sarcopenia, and methods using machine learning are gradually increasing. This study aimed to create a model that can predict sarcopenia using physical characteristics and activity-related variables without medical diagnostic equipment, such as imaging equipment, for the elderly aged 60 years or older.

Methods: A sarcopenia prediction model was constructed using public data obtained from the Korea National Health and Nutrition Examination Survey. Models were built using Logistic Regression, Support Vector Machine (SVM), XGBoost, LightGBM, RandomForest, and Multi-layer Perceptron Neural Network (MLP) algorithms, and the feature importance of the models trained with the algorithms, except for SVM and MLP, was analyzed.

Results: The sarcopenia prediction model built with the LightGBM algorithm achieved the highest test accuracy, of 0.848. In constructing the LightGBM model, physical characteristic variables such as body mass index, weight, and waist circumference showed high importance, and activity-related variables were also used in constructing the model.

Conclusions: The sarcopenia prediction model, which consisted of only physical characteristics and activity-related factors, showed excellent performance. This model has the potential to assist in the early detection of sarcopenia in the elderly, especially in communities with limited access to medical resources or facilities. Geriatr Gerontol Int 2024; 24: 595-602.

Keywords: machine learning; physical activity; physical characteristics; predictive model; sarcopenia.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Algorithms
  • Body Mass Index
  • Female
  • Geriatric Assessment / methods
  • Humans
  • Logistic Models
  • Machine Learning*
  • Male
  • Middle Aged
  • Neural Networks, Computer
  • Nutrition Surveys
  • Republic of Korea / epidemiology
  • Sarcopenia* / diagnosis
  • Sarcopenia* / epidemiology
  • Support Vector Machine