The Tetracentron genome provides insight into the early evolution of eudicots and the formation of vessel elements

Genome Biol. 2020 Dec 2;21(1):291. doi: 10.1186/s13059-020-02198-7.

Abstract

Background: Tetracentron sinense is an endemic and endangered deciduous tree. It belongs to the Trochodendrales, one of four early diverging lineages of eudicots known for having vesselless secondary wood. Sequencing and resequencing of the T. sinense genome will help us understand eudicot evolution, the genetic basis of tracheary element development, and the genetic diversity of this relict species.

Results: Here, we report a chromosome-scale assembly of the T. sinense genome. We assemble the 1.07 Gb genome sequence into 24 chromosomes and annotate 32,690 protein-coding genes. Phylogenomic analyses verify that the Trochodendrales and core eudicots are sister lineages and showed that two whole-genome duplications occurred in the Trochodendrales approximately 82 and 59 million years ago. Synteny analyses suggest that the γ event, resulting in paleohexaploidy, may have only happened in core eudicots. Interestingly, we find that vessel elements are present in T. sinense, which has two orthologs of AtVND7, the master regulator of vessel formation. T. sinense also has several key genes regulated by or regulating TsVND7.2 and their regulatory relationship resembles that in Arabidopsis thaliana. Resequencing and population genomics reveals high levels of genetic diversity of T. sinense and identifies four refugia in China.

Conclusions: The T. sinense genome provides a unique reference for inferring the early evolution of eudicots and the mechanisms underlying vessel element formation. Population genomics analysis of T. sinense reveals its genetic diversity and geographic structure with implications for conservation.

Keywords: Genetic diversity; Phylogenomic; Resequencing; Tetracentron sinense; VND7; Vessel; Whole genome duplication.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics
  • Base Sequence
  • China
  • Evolution, Molecular*
  • Genetic Variation
  • Genome*
  • Genome, Plant*
  • Magnoliopsida / genetics*
  • Phylogeny
  • Plant Proteins / genetics
  • Sequence Analysis
  • Synteny
  • Transcription Factors / genetics
  • Xylem

Substances

  • Plant Proteins
  • Transcription Factors