A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods

HGG Adv. 2024 Apr 23:100299. doi: 10.1016/j.xhgg.2024.100299. Online ahead of print.

Abstract

Background/objectives: Canonical splice site variants (CSSVs) are often presumed to cause loss-of-function (LoF) and are assigned very strong evidence of pathogenicity (according to ACMG criterion PVS1). The exact nature and predictability of splicing effects of unselected rare CSSVs in blood-expressed genes is poorly understood.

Methods: 168 rare CSSVs in unselected blood-expressed genes were identified by genome sequencing in 112 individuals, and their impact on splicing was interrogated manually in RNA sequencing (RNA-seq) data. Blind to these RNA-seq data, we attempted to predict the precise impact of CSSVs by applying in silico tools and the ClinGen Sequence Variant Interpretation Working Group 2018 guidelines for applying PVS1 criterion.

Results: There was no evidence of a frameshift nor of reduced expression consistent with nonsense-mediated decay for 25.6% of CSSVs: 17.9% had wildtype splicing only and normal junction depths, 3.6% resulted in cryptic splice site usage and in-frame indels, 3.6% resulted in full exon skipping (in-frame), and 0.6% resulted in full intron inclusion (in-frame). The predicted impact on splicing using (i) SpliceAI, (ii) MaxEntScan, and (iii) AutoPVS1, an automatic classification tool for PVS1 interpretation of null variants that utilizes Ensembl Variant Effect Predictor and MaxEntScan, was concordant with RNA-seq analyses for 65%, 63% and 61% of CSSVs, respectively.

Conclusion: Approximately 1 in 4 rare CSSVs may not cause LoF based on analysis of RNA-seq data. Predictions from in silico methods were often discordant with findings from RNA-seq. More caution may be warranted in applying PVS1-level evidence to CSSVs in the absence of functional data.