Automated classification of alternative splicing and transcriptional initiation and construction of visual database of classified patterns

Bioinformatics. 2006 May 15;22(10):1211-6. doi: 10.1093/bioinformatics/btl067. Epub 2006 Feb 24.

Abstract

Motivation: Large-scale detection and classification of alternative splicing and transcriptional initiation (ASTI) is the first step towards detailed studies of the functional implication and mechanisms of these phenomena.

Results: We have developed an algorithm that classifies all observed units of ASTI into an extendable set of distinct types (e.g. cassette type) by converting a collection of alignments between a genomic DNA sequence and cDNA sequences into binary description. This description system can uniquely and compactly encode not only typical patterns but also any rare patterns that are usually collectively assigned to 'others.' More than 150 distinct ASTI types were found when this system was applied to genome-wide detection of ASTI units in human and five other eukaryotes.

Availability: The data detected by this system are available through ASTRA (http://alterna.cbrc.jp/), a database equipped with a Java-based browser that can interactively reorganize the order of displayed splicing patterns on demand.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alternative Splicing / genetics*
  • Artificial Intelligence
  • Base Sequence
  • Computer Graphics
  • Databases, Genetic*
  • Molecular Sequence Data
  • Pattern Recognition, Automated / methods
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Transcription Initiation Site*
  • User-Computer Interface*