A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods

Rachel Y Oh; Ali Almail; David Cheerie; George Guirguis; Huayun Hou; Kyoko E Yuki; Bushra Haque; Bhooma Thiruvahindrapuram; Christian R Marshall; Roberto Mendoza-Londono; Adam Shlien; Lianna G Kyriakopoulou; Susan Walker; James J Dowling; Michael D Wilson; Gregory Costain

doi:10.1016/j.xhgg.2024.100299

A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods

HGG Adv. 2024 Apr 23:100299. doi: 10.1016/j.xhgg.2024.100299. Online ahead of print.

Authors

Rachel Y Oh¹, Ali Almail², David Cheerie³, George Guirguis³, Huayun Hou⁴, Kyoko E Yuki⁵, Bushra Haque³, Bhooma Thiruvahindrapuram⁶, Christian R Marshall⁷, Roberto Mendoza-Londono⁸, Adam Shlien⁹, Lianna G Kyriakopoulou⁷, Susan Walker⁶, James J Dowling¹⁰, Michael D Wilson³, Gregory Costain¹¹

Affiliations

¹ Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, Canada; Temerty Faculty of Medicine, University of Toronto, Toronto, Canada.
² Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada.
³ Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Department of Molecular Genetics, University of Toronto, Toronto, Canada.
⁴ Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada.
⁵ Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Division of Genome Diagnostics, Hospital for Sick Children, Toronto, Canada.
⁶ The Centre for Applied Genomics, SickKids Research Institute, Toronto, Canada.
⁷ Division of Genome Diagnostics, Hospital for Sick Children, Toronto, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada.
⁸ Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Department of Paediatrics, University of Toronto, Toronto, Canada.
⁹ Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Department of Molecular Genetics, University of Toronto, Toronto, Canada; Division of Genome Diagnostics, Hospital for Sick Children, Toronto, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada.
¹⁰ Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Department of Molecular Genetics, University of Toronto, Toronto, Canada; Department of Paediatrics, University of Toronto, Toronto, Canada; Division of Neurology, Hospital for Sick Children, Toronto, Canada.
¹¹ Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Canada; Department of Molecular Genetics, University of Toronto, Toronto, Canada; Department of Paediatrics, University of Toronto, Toronto, Canada. Electronic address: gregory.costain@sickkids.ca.

PMID: 38659227
DOI: 10.1016/j.xhgg.2024.100299

Abstract

Background/objectives: Canonical splice site variants (CSSVs) are often presumed to cause loss-of-function (LoF) and are assigned very strong evidence of pathogenicity (according to ACMG criterion PVS1). The exact nature and predictability of splicing effects of unselected rare CSSVs in blood-expressed genes is poorly understood.

Methods: 168 rare CSSVs in unselected blood-expressed genes were identified by genome sequencing in 112 individuals, and their impact on splicing was interrogated manually in RNA sequencing (RNA-seq) data. Blind to these RNA-seq data, we attempted to predict the precise impact of CSSVs by applying in silico tools and the ClinGen Sequence Variant Interpretation Working Group 2018 guidelines for applying PVS1 criterion.

Results: There was no evidence of a frameshift nor of reduced expression consistent with nonsense-mediated decay for 25.6% of CSSVs: 17.9% had wildtype splicing only and normal junction depths, 3.6% resulted in cryptic splice site usage and in-frame indels, 3.6% resulted in full exon skipping (in-frame), and 0.6% resulted in full intron inclusion (in-frame). The predicted impact on splicing using (i) SpliceAI, (ii) MaxEntScan, and (iii) AutoPVS1, an automatic classification tool for PVS1 interpretation of null variants that utilizes Ensembl Variant Effect Predictor and MaxEntScan, was concordant with RNA-seq analyses for 65%, 63% and 61% of CSSVs, respectively.

Conclusion: Approximately 1 in 4 rare CSSVs may not cause LoF based on analysis of RNA-seq data. Predictions from in silico methods were often discordant with findings from RNA-seq. More caution may be warranted in applying PVS1-level evidence to CSSVs in the absence of functional data.