Predicting functional alternative splicing by measuring RNA selection pressure from multigenome alignments

PLoS Comput Biol. 2009 Dec;5(12):e1000608. doi: 10.1371/journal.pcbi.1000608. Epub 2009 Dec 18.

Abstract

High-throughput methods such as EST sequencing, microarrays and deep sequencing have identified large numbers of alternative splicing (AS) events, but studies have shown that only a subset of these may be functional. Here we report a sensitive bioinformatics approach that identifies exons with evidence of a strong RNA selection pressure ratio (RSPR)--i.e., evolutionary selection against mutations that change only the mRNA sequence while leaving the protein sequence unchanged--measured across an entire evolutionary family, which greatly amplifies its predictive power. Using the UCSC 28 vertebrate genome alignment, this approach correctly predicted half to three-quarters of AS exons that are known binding targets of the NOVA splicing regulatory factor, and predicted 345 strongly selected alternative splicing events in human, and 262 in mouse. These predictions were strongly validated by several experimental criteria of functional AS such as independent detection of the same AS event in other species, reading frame-preservation, and experimental evidence of tissue-specific regulation: 75% (15/20) of a sample of high-RSPR exons displayed tissue specific regulation in a panel of ten tissues, vs. only 20% (4/20) among a sample of low-RSPR exons. These data suggest that RSPR can identify exons with functionally important splicing regulation, and provides biologists with a dataset of over 600 such exons. We present several case studies, including both well-studied examples (GRIN1) and novel examples (EXOC7). These data also show that RSPR strongly outperforms other approaches such as standard sequence conservation (which fails to distinguish amino acid selection pressure from RNA selection pressure), or pairwise genome comparison (which lacks adequate statistical power for predicting individual exons).

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alternative Splicing*
  • Animals
  • Carrier Proteins / genetics
  • Computational Biology / methods*
  • Expressed Sequence Tags
  • Genome
  • Humans
  • Mice
  • Models, Genetic*
  • Nerve Tissue Proteins / genetics
  • Oligonucleotide Array Sequence Analysis
  • RNA / genetics*
  • Receptors, N-Methyl-D-Aspartate / genetics
  • Reproducibility of Results
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sequence Alignment / methods*
  • Vesicular Transport Proteins / genetics

Substances

  • Carrier Proteins
  • Exoc7 protein, mouse
  • GRIN1 protein, human
  • Gprin1 protein, mouse
  • Nerve Tissue Proteins
  • Receptors, N-Methyl-D-Aspartate
  • Vesicular Transport Proteins
  • RNA