Two gap-free reference genomes and a global view of the centromere architecture in rice

Mol Plant. 2021 Oct 4;14(10):1757-1767. doi: 10.1016/j.molp.2021.06.018. Epub 2021 Jun 24.

Abstract

Rice (Oryza sativa), a major staple throughout the world and a model system for plant genomics and breeding, was the first crop genome sequenced almost two decades ago. However, reference genomes for all higher organisms to date contain gaps and missing sequences. Here, we report the assembly and analysis of gap-free reference genome sequences for two elite O. sativa xian/indica rice varieties, Zhenshan 97 and Minghui 63, which are being used as a model system for studying heterosis and yield. Gap-free reference genomes provide the opportunity for a global view of the structure and function of centromeres. We show that all rice centromeric regions share conserved centromere-specific satellite motifs with different copy numbers and structures. In addition, the similarity of CentO repeats in the same chromosome is higher than across chromosomes, supporting a model of local expansion and homogenization. Both genomes have over 395 non-TE genes located in centromere regions, of which ∼41% are actively transcribed. Two large structural variants at the end of chromosome 11 affect the copy number of resistance genes between the two genomes. The availability of the two gap-free genomes lays a solid foundation for further understanding genome structure and function in plants and breeding climate-resilient varieties.

Keywords: MH63; ZS97; centromere architecture; hybrid rice; rice genome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Centromere*
  • Chromosomes, Plant*
  • Genome, Plant*
  • Molecular Sequence Annotation
  • Oryza / genetics*
  • Species Specificity
  • Whole Genome Sequencing