Detection of Association Features Based on Gene Eigenvalues and MRI Imaging Using Genetic Weighted Random Forest

Genes (Basel). 2022 Dec 12;13(12):2344. doi: 10.3390/genes13122344.

Abstract

In the studies of Alzheimer's disease (AD), jointly analyzing imaging data and genetic data provides an effective method to explore the potential biomarkers of AD. AD can be separated into healthy controls (HC), early mild cognitive impairment (EMCI), late mild cognitive impairment (LMCI) and AD. In the meantime, identifying the important biomarkers of AD progression, and analyzing these biomarkers in AD provide valuable insights into understanding the mechanism of AD. In this paper, we present a novel data fusion method and a genetic weighted random forest method to mine important features. Specifically, we amplify the difference among AD, LMCI, EMCI and HC by introducing eigenvalues calculated from the gene p-value matrix for feature fusion. Furthermore, we construct the genetic weighted random forest using the resulting fused features. Genetic evolution is used to increase the diversity among decision trees and the decision trees generated are weighted by weights. After training, the genetic weighted random forest is analyzed further to detect the significant fused features. The validation experiments highlight the performance and generalization of our proposed model. We analyze the biological significance of the results and identify some significant genes (CSMD1, CDH13, PTPRD, MACROD2 and WWOX). Furthermore, the calcium signaling pathway, arrhythmogenic right ventricular cardiomyopathy and the glutamatergic synapse pathway were identified. The investigational findings demonstrate that our proposed model presents an accurate and efficient approach to identifying significant biomarkers in AD.

Keywords: Alzheimer’s disease; eigenvalues; fusion; genetic weighted random forest; significant biomarkers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alzheimer Disease* / diagnostic imaging
  • Alzheimer Disease* / genetics
  • Biomarkers
  • Brain
  • Cognitive Dysfunction* / diagnostic imaging
  • Cognitive Dysfunction* / genetics
  • Humans
  • Magnetic Resonance Imaging / methods
  • Random Forest

Substances

  • Biomarkers

Grants and funding

This study was funded by the National Natural Science Foundation of China (61901063, 61875022), the MOE (Ministry of Education in China) Project of Humanities and Social Sciences (19YJCZH120), and the Science and Technology Plan Project of Changzhou (CE20205042, CJ20220151). This work was also sponsored by the Qing Lan Project of Jiangsu Province (2020).