SBSA: an online service for somatic binding sequence annotation

Nucleic Acids Res. 2022 Jan 11;50(1):e4. doi: 10.1093/nar/gkab877.

Abstract

Efficient annotation of alterations in binding sequences of molecular regulators can help identify novel candidates for mechanisms study and offer original therapeutic hypotheses. In this work, we developed Somatic Binding Sequence Annotator (SBSA) as a full-capacity online tool to annotate altered binding motifs/sequences, addressing diverse types of genomic variants and molecular regulators. The genomic variants can be somatic mutation, single nucleotide polymorphism, RNA editing, etc. The binding motifs/sequences involve transcription factors (TFs), RNA-binding proteins, miRNA seeds, miRNA-mRNA 3'-UTR binding target, or can be any custom motifs/sequences. Compared to similar tools, SBSA is the first to support miRNA seeds and miRNA-mRNA 3'-UTR binding target, and it unprecedentedly implements a personalized genome approach that accommodates joint adjacent variants. SBSA is empowered to support an indefinite species, including preloaded reference genomes for SARS-Cov-2 and 25 other common organisms. We demonstrated SBSA by annotating multi-omics data from over 30,890 human subjects. Of the millions of somatic binding sequences identified, many are with known severe biological repercussions, such as the somatic mutation in TERT promoter region which causes a gained binding sequence for E26 transformation-specific factor (ETS1). We further validated the function of this TERT mutation using experimental data in cancer cells. Availability:http://innovebioinfo.com/Annotation/SBSA/SBSA.php.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • Algorithms
  • Amino Acid Motifs
  • COVID-19 / metabolism
  • COVID-19 / virology*
  • Computational Biology / instrumentation*
  • Computational Biology / methods
  • Computers
  • Genetic Techniques
  • Genome, Human
  • Genomics / instrumentation*
  • Genomics / methods
  • Humans
  • Internet
  • MicroRNAs / metabolism
  • Mutation*
  • Phenotype
  • Promoter Regions, Genetic
  • Protein Binding
  • Proteomics / instrumentation*
  • Proteomics / methods
  • Proto-Oncogene Protein c-ets-1 / genetics
  • Proto-Oncogene Protein c-ets-1 / metabolism
  • RNA-Binding Proteins / metabolism
  • SARS-CoV-2*
  • Telomerase / metabolism

Substances

  • 3' Untranslated Regions
  • ETS1 protein, human
  • MicroRNAs
  • Proto-Oncogene Protein c-ets-1
  • RNA-Binding Proteins
  • TERT protein, human
  • Telomerase