Complete chloroplast genome sequence of the medicinal plant Arctium lappa

Genome. 2020 Jan;63(1):53-60. doi: 10.1139/gen-2019-0070. Epub 2019 Oct 3.

Abstract

Arctium lappa, commonly called burdock, has a long medicinal and edible history. It has recently gained increasing attention because of its economic value. In this study, we obtained the complete chloroplast genome of A. lappa by Illumina Hiseq. The complete chloroplast genome of A. lappa is a typical circular structure with 152 708 bp in length. The GC content in the whole chloroplast genome of A. lappa is 37.7%. A total of 37 tRNA genes, 8 rRNA genes, and 87 protein-coding genes were successfully annotated. And the chloroplast genome contains 113 unique genes, 19 of which are duplicated in the inverted repeat. The distribution of 39 simple sequence repeats was analysed, and most of them are in the large single-copy (LSC) sequence. An inversion comprising 16 genes was found in the LSC region, which is 26 283 bp long. We performed multiple sequence alignments using 72 common protein-coding genes of 29 species and constructed a Maximum Parsimony (MP) tree. The MP phylogenetic result shows that A. lappa grouped together with Carthamus tinctorius, Centaurea diffusa, and Saussurea involucrata. The chloroplast genome of A. lappa is a valuable resource for further studies in Asteraceae.

Keywords: Arctium lappa; chloroplast genome; génome chloroplastique; molecular structure; phylogenetic relationship; relation phylogénétique; structure moléculaire.

MeSH terms

  • Arctium / classification
  • Arctium / genetics*
  • Codon Usage
  • DNA, Plant / chemistry
  • Genes, Plant
  • Genome, Chloroplast*
  • Inverted Repeat Sequences
  • Microsatellite Repeats
  • Phylogeny
  • Plants, Medicinal / genetics
  • Repetitive Sequences, Nucleic Acid

Substances

  • DNA, Plant