A centromere map based on super pan-genome highlights the structure and function of rice centromeres

J Integr Plant Biol. 2024 Feb;66(2):196-207. doi: 10.1111/jipb.13607. Epub 2024 Jan 29.

Abstract

Rice (Oryza sativa) is a significant crop worldwide with a genome shaped by various evolutionary factors. Rice centromeres are crucial for chromosome segregation, and contain some unreported genes. Due to the diverse and complex centromere region, a comprehensive understanding of rice centromere structure and function at the population level is needed. We constructed a high-quality centromere map based on the rice super pan-genome consisting of a 251-accession panel comprising both cultivated and wild species of Asian and African rice. We showed that rice centromeres have diverse satellite repeat CentO, which vary across chromosomes and subpopulations, reflecting their distinct evolutionary patterns. We also revealed that long terminal repeats (LTRs), especially young Gypsy-type LTRs, are abundant in the peripheral CentO-enriched regions and drive rice centromere expansion and evolution. Furthermore, high-quality genome assembly and complete telomere-to-telomere (T2T) reference genome enable us to obtain more centromeric genome information despite mapping and cloning of centromere genes being challenging. We investigated the association between structural variations and gene expression in the rice centromere. A centromere gene, OsMAB, which positively regulates rice tiller number, was further confirmed by expression quantitative trait loci, haplotype analysis and clustered regularly interspaced palindromic repeats (CRISPR)/CRISPR-associated protein 9 methods. By revealing the new insights into the evolutionary patterns and biological roles of rice centromeres, our finding will facilitate future research on centromere biology and crop improvement.

Keywords: CentO satellite repeat; centromere; rice; super pan-genome.

MeSH terms

  • Base Sequence
  • Centromere / genetics
  • DNA, Satellite* / metabolism
  • Genome, Plant / genetics
  • Oryza* / genetics
  • Oryza* / metabolism

Substances

  • DNA, Satellite