New insights into the wheat chromosome 4D structure and virtual gene order, revealed by survey pyrosequencing

Plant Sci. 2015 Apr:233:200-212. doi: 10.1016/j.plantsci.2014.12.004. Epub 2014 Dec 18.

Abstract

Survey sequencing of the bread wheat (Triticum aestivum L.) genome (AABBDD) has been approached through different strategies delivering important information. However, the current wheat sequence knowledge is not complete. The aim of our study is to provide different and complementary set of data for chromosome 4D. A survey sequence was obtained by pyrosequencing of flow-sorted 4DS (7.2×) and 4DL (4.1×) arms. Single ends (SE) and long mate pairs (LMP) reads were assembled into contigs (223Mb) and scaffolds (65Mb) that were aligned to Aegilops tauschii draft genome (DD), anchoring 34Mb to chromosome 4. Scaffolds annotation rendered 822 gene models. A virtual gene order comprising 1973 wheat orthologous gene loci and 381 wheat gene models was built. This order was largely consistent with the scaffold order determined based on a published high density map from the Ae. tauschii chromosome 4, using bin-mapped 4D ESTs as a common reference. The virtual order showed a higher collinearity with homeologous 4B compared to 4A. Additionally, a virtual map was constructed and ∼5700 genes (∼2200 on 4DS and ∼3500 on 4DL) predicted. The sequence and virtual order obtained here using the 454 platform were compared with the Illumina one used by the IWGSC, giving complementary information.

Keywords: Chromosome 4D survey sequence; Gene annotation; Gene content; Synteny; Triticum aestivum; Virtual gene order.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Chromosomes, Plant*
  • Expressed Sequence Tags / chemistry
  • Gene Order*
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Data
  • Sequence Analysis, DNA
  • Triticum / genetics*

Associated data

  • GENBANK/JROL00000000