Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63

Proc Natl Acad Sci U S A. 2016 Aug 30;113(35):E5163-71. doi: 10.1073/pnas.1611012113. Epub 2016 Aug 17.

Abstract

Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.

Keywords: BAC-by-BAC strategy; Oryza sativa; reference genomes; transcriptome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping / methods
  • Chromosomes, Plant / genetics*
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant
  • Genes, Plant / genetics
  • Genetic Variation*
  • Genome, Plant / genetics*
  • INDEL Mutation
  • Oryza / classification
  • Oryza / genetics*
  • Polymorphism, Single Nucleotide
  • Species Specificity

Associated data

  • GENBANK/LNNJ00000000
  • GENBANK/LNNK00000000