Complete Chloroplast Genome of Hypericum perforatum and Dynamic Evolution in Hypericum (Hypericaceae)

Int J Mol Sci. 2023 Nov 9;24(22):16130. doi: 10.3390/ijms242216130.

Abstract

Hypericum perforatum (St. John's Wort) is a medicinal plant from the Hypericaceae family. Here, we sequenced the whole chloroplast genome of H. perforatum and compared the genome variation among five Hypericum species to discover dynamic changes and elucidate the mechanisms that lead to genome rearrangements in the Hypericum chloroplast genomes. The H. perforatum chloroplast genome is 139,725 bp, exhibiting a circular quadripartite structure with two copies of inverted repeats (IRs) separating a large single-copy region and a small single-copy region. The H. perforatum chloroplast genome encodes 106 unique genes, including 73 protein-coding genes, 29 tRNAs, and 4 rRNAs. Hypericum chloroplast genomes exhibit genome rearrangement and significant variations among species. The genome size variation among the five Hypericum species was remarkably associated with the expansion or contraction of IR regions and gene losses. Three genes-trnK-UUU, infA, and rps16-were lost, and three genes-rps7, rpl23, and rpl32-were pseudogenized in Hypericum. All the Hypericum chloroplast genomes lost the two introns in clpP, the intron in rps12, and the second intron in ycf3. Hypericum chloroplast genomes contain many long repeat sequences, suggesting a role in facilitating rearrangements. Most genes, according to molecular evolution assessments, are under purifying selection.

Keywords: St. John’s wort; codon usage; intron loss; phylogeny; rearrangement; substitution rate.

MeSH terms

  • Base Sequence
  • Clusiaceae* / genetics
  • Evolution, Molecular
  • Genome, Chloroplast*
  • Hypericum* / genetics
  • Phylogeny
  • Repetitive Sequences, Nucleic Acid