Machine Learning and Bioinformatics Framework Integration to Potential Familial DCM-Related Markers Discovery

Genes (Basel). 2021 Dec 2;12(12):1946. doi: 10.3390/genes12121946.

Abstract

Objectives: Dilated cardiomyopathy (DCM) is characterized by a specific transcriptome. Since the DCM molecular network is largely unknown, the aim was to identify specific disease-related molecular targets combining an original machine learning (ML) approach with protein-protein interaction network.

Methods: The transcriptomic profiles of human myocardial tissues were investigated integrating an original computational approach, based on the Custom Decision Tree algorithm, in a differential expression bioinformatic framework. Validation was performed by quantitative real-time PCR.

Results: Our preliminary study, using samples from transplanted tissues, allowed the discovery of specific DCM-related genes, including MYH6, NPPA, MT-RNR1 and NEAT1, already known to be involved in cardiomyopathies Interestingly, a combination of these expression profiles with clinical characteristics showed a significant association between NEAT1 and left ventricular end-diastolic diameter (LVEDD) (Rho = 0.73, p = 0.05), according to severity classification (NYHA-class III).

Conclusions: The use of the ML approach was useful to discover preliminary specific genes that could lead to a rapid selection of molecular targets correlated with DCM clinical parameters. For the first time, NEAT1 under-expression was significantly associated with LVEDD in the human heart.

Keywords: RNA-sequencing; dilated cardiomyopathy; gene expression analyses; heart failure; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Biomarkers / metabolism*
  • Cardiomyopathy, Dilated / genetics
  • Cardiomyopathy, Dilated / metabolism
  • Cardiomyopathy, Dilated / pathology*
  • Case-Control Studies
  • Computational Biology / methods*
  • Female
  • Humans
  • Machine Learning / standards*
  • Male
  • Middle Aged
  • Protein Interaction Maps*
  • Sequence Analysis, RNA / methods
  • Severity of Illness Index
  • Transcriptome*

Substances

  • Biomarkers

Supplementary concepts

  • Familial dilated cardiomyopathy