Missense variants causing Wiedemann-Steiner syndrome preferentially occur in the KMT2A-CXXC domain and are accurately classified using AlphaFold2

PLoS Genet. 2022 Jun 21;18(6):e1010278. doi: 10.1371/journal.pgen.1010278. eCollection 2022 Jun.

Abstract

Wiedemann-Steiner syndrome (WDSTS) is a neurodevelopmental disorder caused by de novo variants in KMT2A, which encodes a multi-domain histone methyltransferase. To gain insight into the currently unknown pathogenesis of WDSTS, we examined the spatial distribution of likely WDSTS-causing variants across the 15 different domains of KMT2A. Compared to variants in healthy controls, WDSTS variants exhibit a 61.9-fold overrepresentation within the CXXC domain-which mediates binding to unmethylated CpGs-suggesting a major role for this domain in mediating the phenotype. In contrast, we find no significant overrepresentation within the catalytic SET domain. Corroborating these results, we find that hippocampal neurons from Kmt2a-deficient mice demonstrate disrupted histone methylation (H3K4me1 and H3K4me3) preferentially at CpG-rich regions, but this has no systematic impact on gene expression. Motivated by these results, we combine accurate prediction of the CXXC domain structure by AlphaFold2 with prior biological knowledge to develop a classification scheme for missense variants in the CXXC domain. Our classifier achieved 92.6% positive and 92.9% negative predictive value on a hold-out test set. This classification performance enabled us to subsequently perform an in silico saturation mutagenesis and classify a total of 445 variants according to their functional effects. Our results yield a novel insight into the mechanistic basis of WDSTS and provide an example of how AlphaFold2 can contribute to the in silico characterization of variant effects with very high accuracy, suggesting a paradigm potentially applicable to many other Mendelian disorders.

MeSH terms

  • Abnormalities, Multiple* / genetics
  • Animals
  • Craniofacial Abnormalities
  • Growth Disorders* / genetics
  • Histone-Lysine N-Methyltransferase* / genetics
  • Hypertrichosis* / genetics
  • Intellectual Disability* / genetics
  • Mice
  • Mutation, Missense
  • Myeloid-Lymphoid Leukemia Protein* / genetics
  • Protein Domains
  • Protein Folding
  • Syndrome

Substances

  • Myeloid-Lymphoid Leukemia Protein
  • Histone-Lysine N-Methyltransferase
  • Kmt2a protein, mouse

Supplementary concepts

  • Wiedemann Grosse Dibbern syndrome