Gene length and expression level shape genomic novelties

Genome Res. 2014 Sep;24(9):1497-503. doi: 10.1101/gr.169722.113. Epub 2014 Jul 11.

Abstract

Gene duplication and alternative splicing are important mechanisms in the production of genomic novelties. Previous work has shown that a gene's family size and the number of splice variants it produces are inversely related, although the underlying reason is not well understood. Here, we report that gene length and expression level together explain this relationship. We found that gene lengths correlate with both gene duplication and alternative splicing: Longer genes are less likely to produce duplicates and more likely to exhibit alternative splicing. We show that gene length is a dynamic property, increasing with evolutionary time--due in part to the insertions of transposable elements--and decreasing following partial gene duplications. However, gene length alone does not account for the relationship between alternative splicing and gene duplication. A gene's expression level appears both to impose a strong constraint on its length and to restrict gene duplications. Furthermore, high gene expression promotes alternative splicing, in particular for long genes, and alternatively, short genes with low expression levels have large gene families. Our analysis of the human and mouse genomes shows that gene length and expression level are primary genic properties that together account for the relationship between gene duplication and alternative splicing and bias the origin of novelties in the genome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Animals
  • DNA Transposable Elements / genetics
  • Evolution, Molecular*
  • Gene Duplication*
  • Genome, Human*
  • Humans
  • Mice
  • Organ Specificity
  • Transcription, Genetic*

Substances

  • DNA Transposable Elements