Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications

Genome Biol. 2023 May 17;24(1):118. doi: 10.1186/s13059-023-02936-7.

Abstract

Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme .

Keywords: Clinical genetics; Deep intronic; Genomics; Intronic variant; Splice region; Splice site; Splicing; Splicing regulatory element; Variant interpretation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Introns
  • Machine Learning
  • Mutation
  • RNA Splice Sites*
  • RNA Splicing*

Substances

  • RNA Splice Sites