Major repeat components covering one-third of the ginseng (Panax ginseng C.A. Meyer) genome and evidence for allotetraploidy

Plant J. 2014 Mar;77(6):906-16. doi: 10.1111/tpj.12441. Epub 2014 Feb 24.

Abstract

Ginseng (Panax ginseng) is a famous medicinal herb, but the composition and structure of its genome are largely unknown. Here we characterized the major repeat components and inspected their distribution in the ginseng genome. By analyzing three repeat-rich bacterial artificial chromosome (BAC) sequences from ginseng, we identified complex insertion patterns of 34 long terminal repeat retrotransposons (LTR-RTs) and 11 LTR-RT derivatives accounting for more than 80% of the BAC sequences. The LTR-RTs were classified into three Ty3/gypsy (PgDel, PgTat and PgAthila) and two Ty1/Copia (PgTork and PgOryco) families. Mapping of 30-Gbp Illumina whole-genome shotgun reads to the BAC sequences revealed that these five LTR-RT families occupy at least 34% of the ginseng genome. The Ty3/Gypsy families were predominant, comprising 74 and 33% of the BAC sequences and the genome, respectively. In particular, the PgDel family accounted for 29% of the genome and presumably played major roles in enlargement of the size of the ginseng genome. Fluorescence in situ hybridization (FISH) revealed that the PgDel1 elements are distributed throughout the chromosomes along dispersed heterochromatic regions except for ribosomal DNA blocks. The intensity of the PgDel2 FISH signals was biased toward 24 out of 48 chromosomes. Unique gene probes showed two pairs of signals with different locations, one pair in subtelomeric regions on PgDel2-rich chromosomes and the other in interstitial regions on PgDel2-poor chromosomes, demonstrating allotetraploidy in ginseng. Our findings promote understanding of the evolution of the ginseng genome and of that of related species in the Araliaceae.

Keywords: Panax ginseng; allotetraploidy; genome evolution; heterochromatin; long terminal repeat retrotransposon.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Chromosomes, Artificial, Bacterial
  • Chromosomes, Plant / genetics*
  • DNA, Plant / genetics
  • Evolution, Molecular
  • Genome, Plant / genetics*
  • Heterochromatin
  • In Situ Hybridization, Fluorescence
  • Models, Genetic
  • Molecular Sequence Data
  • Panax / cytology
  • Panax / genetics*
  • Phylogeny
  • Retroelements / genetics*
  • Sequence Analysis, DNA
  • Terminal Repeat Sequences / genetics*
  • Tetraploidy

Substances

  • DNA, Plant
  • Heterochromatin
  • Retroelements

Associated data

  • GENBANK/KF357942
  • GENBANK/KF357943
  • GENBANK/KF357944