Network-Based Analysis Reveals Novel Biomarkers in Peripheral Blood of Patients With Preeclampsia

Front Mol Biosci. 2022 Jun 16:9:757203. doi: 10.3389/fmolb.2022.757203. eCollection 2022.

Abstract

WGCNA is a potent systems biology approach that explains the connection of gene expression based on a microarray database, which facilitates the discovery of disease therapy targets or potential biomarkers. Preeclampsia is a kind of pregnancy-induced hypertension caused by complex factors. The disease's pathophysiology, however, remains unknown. The focus of this research is to utilize WGCNA to identify susceptible modules and genes in the peripheral blood of preeclampsia patients. Obtain the whole gene expression data of GSE48424 preeclampsia patients and normal pregnant women from NCBI's GEO database. WGCNA is used to construct a gene co-expression network by calculating correlation coefficients between modules and phenotypic traits, screening important modules, and filtering central genes. To identify hub genes, we performed functional enrichment analysis, pathway analysis, and protein-protein interaction (PPI) network construction on key genes in critical modules. Then, the genetic data file GSE149437 and clinical peripheral blood samples were used as a validation cohort to determine the diagnostic value of these key genes. Nine gene co-expression modules were constructed through WGCNA analysis. Among them, the blue module is significantly related to preeclampsia and is related to its clinical severity. Thirty genes have been discovered by using the intersection of the genes in the blue module and the DEGs genes as the hub genes. It was found that HDC, MS4A2, and SLC18A2 scored higher in the PPI network and were identified as hub genes. These three genes were also differentially expressed in peripheral blood validation samples. Based on the above three genes, we established the prediction model of peripheral blood markers of preeclampsia and drew the nomogram and calibration curve. The ROC curves were used in the training cohort GSE48424 and the validation cohort GSE149437 to verify the predictive value of the above model. Finally, it was confirmed in the collected clinical peripheral blood samples that MS4A2 was differentially expressed in the peripheral blood of early-onset and late-onset preeclampsia, which is of great significance. This study provides a new biomarker and prediction model for preeclampsia.

Keywords: GEO; MS4A2; WGCNA; nomogram; preeclampsia.