Structure and expression analysis of rice paleo duplications

Nucleic Acids Res. 2009 Mar;37(4):1248-59. doi: 10.1093/nar/gkn1048. Epub 2009 Jan 9.

Abstract

Having a well-known history of genome duplication, rice is a good model for studying structural and functional evolution of paleo duplications. Improved sequence alignment criteria were used to characterize 10 major chromosome-to-chromosome duplication relationships associated with 1440 paralogous pairs, covering 47.8% of the rice genome, with 12.6% of genes that are conserved within sister blocks. Using a micro-array experiment, a genome-wide expression map has been produced, in which 2382 genes show significant differences of expression in root, leaf and grain. By integrating both structural (1440 paralogous pairs) and functional information (2382 differentially expressed genes), we identified 115 paralogous gene pairs for which at least one copy is differentially expressed in one of the three tissues. A vast majority of the 115 paralogous gene pairs have been neofunctionalized or subfunctionalized as 88%, 89% and 96% of duplicates, respectively, expressed in grain, leaf and root show distinct expression patterns. On the basis of a Gene Ontology analysis, we have identified and characterized the gene families that have been structurally and functionally preferentially retained in the duplication showing that the vast majority (>85%) of duplicated have been either lost or have been subfunctionalized or neofunctionalized during 50-70 million years of evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Evolution, Molecular*
  • Gene Duplication*
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant
  • Genes, Plant*
  • Genome, Plant*
  • Genomics
  • Oligonucleotide Array Sequence Analysis
  • Oryza / genetics*
  • Oryza / metabolism
  • Polyploidy*
  • Sequence Alignment