Unlocking the potential of microRNAs: machine learning identifies key biomarkers for myocardial infarction diagnosis

Cardiovasc Diabetol. 2023 Sep 11;22(1):247. doi: 10.1186/s12933-023-01957-7.

Abstract

Background: MicroRNAs (miRNAs) play a crucial role in regulating adaptive and maladaptive responses in cardiovascular diseases, making them attractive targets for potential biomarkers. However, their potential as novel biomarkers for diagnosing cardiovascular diseases requires systematic evaluation.

Methods: In this study, we aimed to identify a key set of miRNA biomarkers using integrated bioinformatics and machine learning analysis. We combined and analyzed three gene expression datasets from the Gene Expression Omnibus (GEO) database, which contains peripheral blood mononuclear cell (PBMC) samples from individuals with myocardial infarction (MI), stable coronary artery disease (CAD), and healthy individuals. Additionally, we selected a set of miRNAs based on their area under the receiver operating characteristic curve (AUC-ROC) for separating the CAD and MI samples. We designed a two-layer architecture for sample classification, in which the first layer isolates healthy samples from unhealthy samples, and the second layer classifies stable CAD and MI samples. We trained different machine learning models using both biomarker sets and evaluated their performance on a test set.

Results: We identified hsa-miR-21-3p, hsa-miR-186-5p, and hsa-miR-32-3p as the differentially expressed miRNAs, and a set including hsa-miR-186-5p, hsa-miR-21-3p, hsa-miR-197-5p, hsa-miR-29a-5p, and hsa-miR-296-5p as the optimum set of miRNAs selected by their AUC-ROC. Both biomarker sets could distinguish healthy from not-healthy samples with complete accuracy. The best performance for the classification of CAD and MI was achieved with an SVM model trained using the biomarker set selected by AUC-ROC, with an AUC-ROC of 0.96 and an accuracy of 0.94 on the test data.

Conclusions: Our study demonstrated that miRNA signatures derived from PBMCs could serve as valuable novel biomarkers for cardiovascular diseases.

Keywords: Bioinformatics; Biomarker; Machine learning; MicroRNA; Myocardial infarction.

MeSH terms

  • Biomarkers
  • Coronary Artery Disease* / diagnosis
  • Coronary Artery Disease* / genetics
  • Humans
  • Leukocytes, Mononuclear
  • Machine Learning
  • MicroRNAs* / genetics
  • Myocardial Infarction* / diagnosis
  • Myocardial Infarction* / genetics

Substances

  • MicroRNAs
  • Biomarkers
  • MIRN186 microRNA, human
  • MIRN296 microRNA, human