Classification and deep-learning-based prediction of Alzheimer disease subtypes by using genomic data

Transl Psychiatry. 2023 Jun 29;13(1):232. doi: 10.1038/s41398-023-02531-1.

Abstract

Late-onset Alzheimer's disease (LOAD) is the most common multifactorial neurodegenerative disease among elderly people. LOAD is heterogeneous, and the symptoms vary among patients. Genome-wide association studies (GWAS) have identified genetic risk factors for LOAD but not for LOAD subtypes. Here, we examined the genetic architecture of LOAD based on Japanese GWAS data from 1947 patients and 2192 cognitively normal controls in a discovery cohort and 847 patients and 2298 controls in an independent validation cohort. Two distinct groups of LOAD patients were identified. One was characterized by major risk genes for developing LOAD (APOC1 and APOC1P1) and immune-related genes (RELB and CBLC). The other was characterized by genes associated with kidney disorders (AXDND1, FBP1, and MIR2278). Subsequent analysis of albumin and hemoglobin values from routine blood test results suggested that impaired kidney function could lead to LOAD pathogenesis. We developed a prediction model for LOAD subtypes using a deep neural network, which achieved an accuracy of 0.694 (2870/4137) in the discovery cohort and 0.687 (2162/3145) in the validation cohort. These findings provide new insights into the pathogenic mechanisms of LOAD.

MeSH terms

  • Aged
  • Alzheimer Disease* / genetics
  • Deep Learning*
  • Genome-Wide Association Study
  • Genomics
  • Humans
  • Neurodegenerative Diseases*