Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis

Proc Natl Acad Sci U S A. 1999 Apr 13;96(8):4482-7. doi: 10.1073/pnas.96.8.4482.

Abstract

We measured the expression pattern and analyzed codon usage in 8,133, 1,550, and 2,917 genes, respectively, from Caenorhabditis elegans, Drosophila melanogaster, and Arabidopsis thaliana. In those three species, we observed a clear correlation between codon usage and gene expression levels and showed that this correlation is not due to a mutational bias. This provides direct evidence for selection on silent sites in those three distantly related multicellular eukaryotes. Surprisingly, there is a strong negative correlation between codon usage and protein length. This effect is not due to a smaller size of highly expressed proteins. Thus, for a same-expression pattern, the selective pressure on codon usage appears to be lower in genes encoding long rather than short proteins. This puzzling observation is not predicted by any of the current models of selection on codon usage and thus raises the question of how translation efficiency affects fitness in multicellular organisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Arabidopsis / genetics*
  • Biological Evolution*
  • Caenorhabditis elegans / embryology
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans / growth & development
  • Codon / genetics*
  • Drosophila melanogaster / genetics*
  • Expressed Sequence Tags
  • Gene Expression Regulation*
  • Gene Expression Regulation, Developmental
  • Gene Expression Regulation, Plant
  • Mutation
  • RNA, Messenger / analysis
  • Selection, Genetic

Substances

  • Codon
  • RNA, Messenger