Machine learning analysis with population data for the associations of preterm birth with temporomandibular disorder and gastrointestinal diseases

PLoS One. 2024 Jan 2;19(1):e0296329. doi: 10.1371/journal.pone.0296329. eCollection 2024.

Abstract

This study employs machine learning analysis with population data for the associations of preterm birth (PTB) with temporomandibular disorder (TMD) and gastrointestinal diseases. The source of the population-based retrospective cohort was Korea National Health Insurance claims for 489,893 primiparous women with delivery at the age of 25-40 in 2017. The dependent variable was PTB in 2017. Twenty-one predictors were included, i.e., demographic, socioeconomic, disease and medication information during 2002-2016. Random forest variable importance was derived for finding important predictors of PTB and evaluating its associations with the predictors including TMD and gastroesophageal reflux disease (GERD). Shapley Additive Explanation (SHAP) values were calculated to analyze the directions of these associations. The random forest with oversampling registered a much higher area under the receiver-operating-characteristic curve compared to logistic regression with oversampling, i.e., 79.3% vs. 53.1%. According to random forest variable importance values and rankings, PTB has strong associations with low socioeconomic status, GERD, age, infertility, irritable bowel syndrome, diabetes, TMD, salivary gland disease, hypertension, tricyclic antidepressant and benzodiazepine. In terms of max SHAP values, these associations were positive, e.g., low socioeconomic status (0.29), age (0.21), GERD (0.27) and TMD (0.23). The inclusion of low socioeconomic status, age, GERD or TMD into the random forest will increase the probability of PTB by 0.29, 0.21, 0.27 or 0.23. A cutting-edge approach of explainable artificial intelligence highlights the strong associations of preterm birth with temporomandibular disorder, gastrointestinal diseases and antidepressant medication. Close surveillance is needed for pregnant women regarding these multiple risks at the same time.

MeSH terms

  • Artificial Intelligence
  • Female
  • Gastroesophageal Reflux* / complications
  • Gastroesophageal Reflux* / drug therapy
  • Gastroesophageal Reflux* / epidemiology
  • Humans
  • Infant, Newborn
  • Machine Learning
  • Pregnancy
  • Premature Birth* / epidemiology
  • Retrospective Studies
  • Temporomandibular Joint Disorders* / epidemiology

Grants and funding

This work was supported by (1) the Korea University Medical Center grant (No. K1925051; Author K.H.A.; https://www.kumc.or.kr/en/index.do), (2) the Korea Health Industry Development Institute grant funded by the Ministry of Health & Welfare of South Korea (No. HI21C156001; Author K.H.A.; https://www.khidi.or.kr/eps), (3) the Korea Health Industry Development Institute grant (Korea Health Technology R&D Project) funded by the Ministry of Health & Welfare of South Korea (No. HI22C1463; Author K.H.A.; https://www.khidi.or.kr/eps), (4) the Korea Health Industry Development Institute grant (Korea Health Technology R&D Project) funded by the Ministry of Health & Welfare of South Korea (No. HI22C1302; Author K.-S.L.; https://www.khidi.or.kr/eps) and (5) the Korea Medical Device Development Fund grant funded by the Ministry of Science and ICT, the Ministry of Trade, Industry, and Energy, the Ministry of Health & Welfare, the Ministry of Food and Drug Safety of South Korea) (No. RS-2021-KD000009; Author E.S.K.; https://www.kmdf.org/). There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.