Predictive models of miscarriage on the basis of data from a preconception cohort study

Fertil Steril. 2024 Apr 10:S0015-0282(24)00235-8. doi: 10.1016/j.fertnstert.2024.04.007. Online ahead of print.

Abstract

Objective: To use self-reported preconception data to derive models that predict the risk of miscarriage.

Design: Prospective preconception cohort study.

Setting: Not applicable.

Patients: Study participants were female, aged 21-45 years, residents of the United States or Canada, and attempting spontaneous pregnancy at enrollment during 2013-2022. Participants were followed for up to 12 months of pregnancy attempts; those who conceived were followed through pregnancy and postpartum. We restricted analyses to participants who conceived during the study period.

Exposure: On baseline and follow-up questionnaires completed every 8 weeks until pregnancy, we collected self-reported data on sociodemographic factors, reproductive history, lifestyle, anthropometrics, diet, medical history, and male partner characteristics. We included 160 potential predictor variables in our models.

Main outcome measures: The primary outcome was a miscarriage, defined as pregnancy loss before 20 weeks of gestation. We followed participants from their first positive pregnancy test until miscarriage or a censoring event (induced abortion, ectopic pregnancy, loss of follow-up, or 20 weeks of gestation), whichever occurred first. We fit both survival and static models using Cox proportional hazards models, logistic regression, support vector machines, gradient-boosted trees, and random forest algorithms. We evaluated model performance using the concordance index (survival models) and the weighted F1 score (static models).

Results: Among the 8,720 participants who conceived, 20.4% reported miscarriage. In multivariable models, the strongest predictors of miscarriage were female age, history of miscarriage, and male partner age. The weighted F1 score ranged from 73%-89% for static models and the concordance index ranged from 53%-56% for survival models, indicating better discrimination for the static models compared with the survival models (i.e., the ability of the model to discriminate between individuals with and without miscarriage). No appreciable differences were observed across strata of miscarriage history or among models restricted to ≥8 weeks of gestation.

Conclusion: Our findings suggest that miscarriage is not easily predicted on the basis of preconception lifestyle characteristics and that advancing age and a history of miscarriage are the most important predictors of incident miscarriage.

Keywords: Miscarriage; machine learning; predictive modeling; pregnancy; spontaneous abortion.