Large haplotypes highlight a complex age structure within the maize pan-genome

Genome Res. 2023 Mar;33(3):359-370. doi: 10.1101/gr.276705.122. Epub 2023 Feb 28.

Abstract

The genomes of maize and other eukaryotes contain stable haplotypes in regions of low recombination. These regions, including centromeres, long heterochromatic blocks, and rDNA arrays, have been difficult to analyze with respect to their diversity and origin. Greatly improved genome assemblies are now available that enable comparative genomics over these and other nongenic spaces. Using 26 complete maize genomes, we developed methods to align intergenic sequences while excluding genes and regulatory regions. The centromere haplotypes (cenhaps) extend for megabases on either side of the functional centromere regions and appear as evolutionary strata, with haplotype divergence/coalescence times dating as far back as 450 thousand years ago (kya). Application of the same methods to other low recombination regions (heterochromatic knobs and rDNA) and all intergenic spaces revealed that deep coalescence times are ubiquitous across the maize pan-genome. Divergence estimates vary over a broad timescale with peaks at ∼16 and 300 kya, reflecting a complex history of gene flow among diverging populations and changes in population size associated with domestication. Cenhaps and other long haplotypes provide vivid displays of this ancient diversity.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Centromere* / genetics
  • Genome, Plant
  • Genomics / methods
  • Haplotypes
  • Zea mays* / genetics