Deep transcriptome sequencing provides new insights into the structural and functional organization of the wheat genome

Genome Biol. 2015 Feb 10;16(1):29. doi: 10.1186/s13059-015-0601-9.

Abstract

Background: Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before.

Results: By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level.

Conclusions: Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing / genetics
  • Chromosomes, Plant / genetics
  • Gene Expression Regulation, Plant
  • Genes, Plant
  • Genome, Plant*
  • High-Throughput Nucleotide Sequencing / methods*
  • Multigene Family
  • Nucleic Acid Conformation
  • Transcription, Genetic
  • Transcriptome / genetics*
  • Triticum / genetics*