Screening for chronic conditions with reproductive factors using a machine learning based approach

Sci Rep. 2020 Feb 18;10(1):2848. doi: 10.1038/s41598-020-59825-3.

Abstract

A large proportion of cases with chronic conditions including diabetes or pre-diabetes, hypertension and dyslipidemia remain undiagnosed. To include reproductive factors (RF) might be able to improve current screening guidelines by providing extra effectiveness. The objective is to study the relationships between RFs and chronic conditions' biomarkers. A cross-sectional study was conducted. Demographics, RFs and metabolic biomarkers were collected. The relationship of the metabolic biomarkers were shown by correlation analysis. Principal component analysis (PCA) and autoencoder were compared by cross-validation. The better one was adopted to extract a single marker, the general chronic condition (GCC), to represent the body's chronic conditions. Multivariate linear regression was performed to explore the relationship between GCC and RFs. In total, 1,656 postmenopausal females were included. A multi-layer autoencoder outperformed PCA in the dimensionality reduction performance. The extracted variable by autoencoder, GCC, was verified to be representative of three chronic conditions (AUC for patoglycemia, hypertension and dyslipidemia were 0.844, 0.824 and 0.805 respectively). Linear regression showed that earlier age at menarche (OR = 0.9976) and shorter reproductive life span (OR = 0.9895) were associated with higher GCC. Autoencoder performed well in the dimensionality reduction of clinical metabolic biomarkers. Due to high accessibility and effectiveness, RFs have potential to be included in screening tools for general chronic conditions and could enhance current screening guidelines.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Biomarkers / blood
  • Chronic Disease / epidemiology
  • Diabetes Mellitus / blood
  • Diabetes Mellitus / diagnosis
  • Diabetes Mellitus / epidemiology*
  • Diabetes Mellitus / pathology
  • Dyslipidemias / diagnosis
  • Dyslipidemias / epidemiology*
  • Dyslipidemias / pathology
  • Female
  • Guidelines as Topic
  • Humans
  • Hypertension / diagnosis
  • Hypertension / epidemiology*
  • Hypertension / pathology
  • Machine Learning
  • Male
  • Mass Screening
  • Middle Aged
  • Multivariate Analysis
  • Prediabetic State / diagnosis
  • Prediabetic State / epidemiology*
  • Prediabetic State / pathology
  • Principal Component Analysis
  • Risk Factors

Substances

  • Biomarkers