Identification and verification of diagnostic biomarkers in recurrent pregnancy loss via machine learning algorithm and WGCNA

Front Immunol. 2023 Aug 25:14:1241816. doi: 10.3389/fimmu.2023.1241816. eCollection 2023.

Abstract

Background: Recurrent pregnancy loss defined as the occurrence of two or more pregnancy losses before 20-24 weeks of gestation, is a prevalent and significant pathological condition that impacts human reproductive health. However, the underlying mechanism of RPL remains unclear. This study aimed to investigate the biomarkers and molecular mechanisms associated with RPL and explore novel treatment strategies for clinical applications.

Methods: The GEO database was utilized to retrieve the RPL gene expression profile GSE165004. This profile underwent differential expression analysis, WGCNA, functional enrichment, and subsequent analysis of RPL gene expression using LASSO regression, SVM-RFE, and RandomForest algorithms for hub gene screening. ANN model were constructed to assess the performance of hub genes in the dataset. The expression of hub genes in both the RPL and control group samples was validated using RT-qPCR. The immune cell infiltration level of RPL was assessed using CIBERSORT. Additionally, pan-cancer analysis was conducted using Sangerbox, and small-molecule drug screening was performed using CMap.

Results: A total of 352 DEGs were identified, including 198 up-regulated genes and 154 down-regulated genes. Enrichment analysis indicated that the DEGs were primarily associated with Fc gamma R-mediated phagocytosis, the Fc epsilon RI signaling pathway, and various metabolism-related pathways. The turquoise module, which showed the highest relevance to clinical symptoms based on WGCNA results, contained 104 DEGs. Three hub genes, WBP11, ACTR2, and NCSTN, were identified using machine learning algorithms. ROC curves demonstrated a strong diagnostic value when the three hub genes were combined. RT-qPCR confirmed the low expression of WBP11 and ACTR2 in RPL, whereas NCSTN exhibited high expression. The immune cell infiltration analysis results indicated an imbalance of macrophages in RPL. Meanwhile, these three hub genes exhibited aberrant expression in multiple malignancies and were associated with a poor prognosis. Furthermore, we identified several small-molecule drugs.

Conclusion: This study identifies and validates hub genes in RPL, which may lead to significant advancements in understanding the molecular mechanisms and treatment strategies for this condition.

Keywords: WGCNA; diagnostic biomarkers; immune cell infiltration; machine learning; recurrent pregnancy loss.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Abortion, Habitual* / diagnosis
  • Abortion, Habitual* / genetics
  • Algorithms
  • DNA-Binding Proteins
  • Female
  • Genes, Regulator*
  • Humans
  • Machine Learning
  • Pregnancy
  • RNA Splicing Factors
  • Transcription Factors

Substances

  • Transcription Factors
  • WBP11 protein, human
  • RNA Splicing Factors
  • DNA-Binding Proteins

Grants and funding

This work was supported by grants from the National Natural Science Foundation of China (Nos. 81960281and 82260306), Special Fund of Characteristic Innovation Team of the First Affiliated Hospital of Guangxi Medical University (NO YYZS202008), and the construction of clinical intervention protocols Guangxi key R & D program (Guike AB20159031).