Identification of endocrine-disrupting chemicals targeting key DCM-associated genes via bioinformatics and machine learning

Ecotoxicol Environ Saf. 2024 Apr 1:274:116168. doi: 10.1016/j.ecoenv.2024.116168. Epub 2024 Mar 9.

Abstract

Dilated cardiomyopathy (DCM) is a primary cause of heart failure (HF), with the incidence of HF increasing consistently in recent years. DCM pathogenesis involves a combination of inherited predisposition and environmental factors. Endocrine-disrupting chemicals (EDCs) are exogenous chemicals that interfere with endogenous hormone action and are capable of targeting various organs, including the heart. However, the impact of these disruptors on heart disease through their effects on genes remains underexplored. In this study, we aimed to explore key DCM-related genes using machine learning (ML) and the construction of a predictive model. Using the Gene Expression Omnibus (GEO) database, we screened differentially expressed genes (DEGs) and performed enrichment analyses of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways related to DCM. Through ML techniques combining maximum relevance minimum redundancy (mRMR) and least absolute shrinkage and selection operator (LASSO) logistic regression, we identified key genes for predicting DCM (IL1RL1, SEZ6L, SFRP4, COL22A1, RNASE2, HB). Based on these key genes, 79 EDCs with the potential to affect DCM were identified, among which 4 (3,4-dichloroaniline, fenitrothion, pyrene, and isoproturon) have not been previously associated with DCM. These findings establish a novel relationship between the EDCs mediated by key genes and the development of DCM.

Keywords: Bioinformatics analysis; Dilated cardiomyopathy; EDCs-genes-disease Interactions; Endocrine-disrupting chemicals; Machine learning.

MeSH terms

  • Cardiomyopathy, Dilated*
  • Computational Biology
  • Endocrine Disruptors* / toxicity
  • Heart
  • Heart Diseases*
  • Humans
  • Machine Learning

Substances

  • Endocrine Disruptors