Annotating high-impact 5'untranslated region variants with the UTRannotator

Bioinformatics. 2021 May 23;37(8):1171-1173. doi: 10.1093/bioinformatics/btaa783.

Abstract

Summary: Current tools to annotate the predicted effect of genetic variants are heavily biased towards protein-coding sequence. Variants outside of these regions may have a large impact on protein expression and/or structure and can lead to disease, but this effect can be challenging to predict. Consequently, these variants are poorly annotated using standard tools. We have developed a plugin to the Ensembl Variant Effect Predictor, the UTRannotator, that annotates variants in 5'untranslated regions (5'UTR) that create or disrupt upstream open reading frames. We investigate the utility of this tool using the ClinVar database, providing an annotation for 31.9% of all 5'UTR (likely) pathogenic variants, and highlighting 31 variants of uncertain significance as candidates for further follow-up. We will continue to update the UTRannotator as we gain new knowledge on the impact of variants in UTRs.

Availability and implementation: UTRannotator is freely available on Github: https://github.com/ImperialCardioGenetics/UTRannotator.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions* / genetics
  • Humans
  • Molecular Sequence Annotation
  • Open Reading Frames / genetics
  • Software*

Substances

  • 5' Untranslated Regions