Prediction of Adult Height by Machine Learning Technique

J Clin Endocrinol Metab. 2021 Jun 16;106(7):e2700-e2710. doi: 10.1210/clinem/dgab093.

Abstract

Context: Prediction of AH is frequently undertaken in the clinical setting. The commonly used methods are based on the assessment of skeletal maturation. Predictive algorithms generated by machine learning, which can already automatically drive cars and recognize spoken language, are the keys to unlocking data that can precisely inform the pediatrician for real-time decision making.

Objective: To use machine learning (ML) to predict adult height (AH) based on growth measurements until age 6 years.

Methods: Growth data from 1596 subjects (798 boys) aged 0-20 years from the longitudinal GrowUp 1974 Gothenburg cohort were utilized to train multiple ML regressors. Of these, 100 were used for model comparison, the rest was used for 5-fold cross-validation. The winning model, random forest (RF), was first validated on 684 additional subjects from the 1974 cohort. It was additionally validated using 1890 subjects from the GrowUp 1990 Gothenburg cohort and 145 subjects from the Edinburgh Longitudinal Growth Study cohort.

Results: RF with 51 regression trees produced the most accurate predictions. The best predicting features were sex and height at age 3.4-6.0 years. Observed and predicted AHs were 173.9 ± 8.9 cm and 173.9 ± 7.7 cm, respectively, with prediction average error of -0.4 ± 4.0 cm. Validation of prediction for 684 GrowUp 1974 children showed prediction accuracy r = 0.87 between predicted and observed AH (R2 = 0.75). When validated on the 1990 Gothenburg and Edinburgh cohorts (completely unseen by the learned RF model), the prediction accuracy was r = 0.88 in both cases (R2 = 0.77). AH in short children was overpredicted and AH in tall children was underpredicted. Prediction absolute error correlated negatively with AH (P < .0001).

Conclusion: We show successful, validated ML of AH using growth measurements before age 6 years. The most important features for prediction were sex, and height at age 3.4-6.0. Prediction errors result in over- or underestimates of AH for short and tall subjects, respectively. Prediction by ML can be generalized to other cohorts.

Keywords: Artificial intelligence; child growth; growth analyses; height prediction; random tree.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Adult
  • Algorithms
  • Anthropometry / methods*
  • Body Height*
  • Child
  • Child, Preschool
  • Decision Support Techniques
  • Growth Charts
  • Humans
  • Infant
  • Infant, Newborn
  • Longitudinal Studies
  • Machine Learning*
  • Male
  • Pediatrics
  • Predictive Value of Tests
  • Regression Analysis