From comparative gene content and gene order to ancestral contigs, chromosomes and karyotypes

Sci Rep. 2023 Apr 13;13(1):6095. doi: 10.1038/s41598-023-33029-x.

Abstract

To reconstruct the ancestral genome of a set of phylogenetically related descendant species, we use the RACCROCHE pipeline for organizing a large number of generalized gene adjacencies into contigs and then into chromosomes. Separate reconstructions are carried out for each ancestral node of the phylogenetic tree for focal taxa. The ancestral reconstructions are monoploids; they each contain at most one member of each gene family constructed from descendants, ordered along the chromosomes. We design and implement a new computational technique for solving the problem of estimating the ancestral monoploid number of chromosomes x. This involves a "g-mer" analysis to resolve a bias due long contigs, and gap statistics to estimate x. We find that the monoploid number of all the rosid and asterid orders is [Formula: see text]. We show that this is not an artifact of our method by deriving [Formula: see text] for the metazoan ancestor.

MeSH terms

  • Animals
  • Chromosomes* / genetics
  • Evolution, Molecular*
  • Gene Order
  • Genome
  • Karyotype
  • Phylogeny