5' splice site GC>GT and GT>GC variants differ markedly in terms of their functionality and pathogenicity

Hum Mutat. 2020 Aug;41(8):1358-1364. doi: 10.1002/humu.24029. Epub 2020 May 12.

Abstract

In the human genome, most 5' splice sites (~99%) employ the canonical GT dinucleotide whereas a small minority (~1%) use the noncanonical GC dinucleotide. The functionality and pathogenicity of 5' splice site GT>GC (+2T>C) variants have been extensively studied but we know very little about 5' splice site GC>GT (+2C>T) variants. Herein, we have addressed this deficiency by performing a meta-analysis of reported +2C>T "pathogenic" variants together with a functional analysis of engineered +2C>T substitutions using a cell culture-based full-length gene splicing assay. Our results establish proof of concept that +2C>T variants are qualitatively different from +2T>C variants in terms of their functionality and suggest that, in sharp contrast to +2T>C variants, most if not all +2C>T variants have no pathological relevance. Our findings have important implications for interpreting the clinical relevance of +2C>T variants and understanding the evolutionary switching between GT and GC 5' splice sites in mammalian genomes.

Keywords: +2C>T variant; +2T>C variant; 5′ splice site; Human Gene Mutation Database; full-length gene splicing assay; noncanonical GC dinucleotide.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Mutational Analysis
  • Dystrophin / genetics
  • Humans
  • Mutation*
  • RNA Splice Sites*
  • Thrombospondins / genetics

Substances

  • DMD protein, human
  • Dystrophin
  • RNA Splice Sites
  • THSD7B protein, human
  • Thrombospondins