Single-nucleus gene and gene set expression-based similarity network fusion identifies autism molecular subtypes

BMC Bioinformatics. 2023 Apr 11;24(1):142. doi: 10.1186/s12859-023-05278-0.

Abstract

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder that is highly phenotypically and genetically heterogeneous. With the accumulation of biological sequencing data, more and more studies shift to molecular subtype-first approach, from identifying molecular subtypes based on genetic and molecular data to linking molecular subtypes with clinical manifestation, which can reduce heterogeneity before phenotypic profiling.

Results: In this study, we perform similarity network fusion to integrate gene and gene set expression data of multiple human brain cell types for ASD molecular subtype identification. Then we apply subtype-specific differential gene and gene set expression analyses to study expression patterns specific to molecular subtypes in each cell type. To demonstrate the biological and practical significance, we analyze the molecular subtypes, investigate their correlation with ASD clinical phenotype, and construct ASD molecular subtype prediction models.

Conclusions: The identified molecular subtype-specific gene and gene set expression may be used to differentiate ASD molecular subtypes, facilitating the diagnosis and treatment of ASD. Our method provides an analytical pipeline for the identification of molecular subtypes and even disease subtypes of complex disorders.

Keywords: Autism; Gene set; Molecular subtype; Similarity network fusion; Single-nucleus RNA-seq data.

MeSH terms

  • Autism Spectrum Disorder* / genetics
  • Autistic Disorder* / genetics
  • Brain / metabolism
  • Humans