Complete chloroplast genome sequence of poisonous and medicinal plant Datura stramonium: organizations and implications for genetic engineering

PLoS One. 2014 Nov 3;9(11):e110656. doi: 10.1371/journal.pone.0110656. eCollection 2014.

Abstract

Datura stramonium is a widely used poisonous plant with great medicinal and economic value. Its chloroplast (cp) genome is 155,871 bp in length with a typical quadripartite structure of the large (LSC, 86,302 bp) and small (SSC, 18,367 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,601 bp). The genome contains 113 unique genes, including 80 protein-coding genes, 29 tRNAs and four rRNAs. A total of 11 forward, 9 palindromic and 13 tandem repeats were detected in the D. stramonium cp genome. Most simple sequence repeats (SSR) are AT-rich and are less abundant in coding regions than in non-coding regions. Both SSRs and GC content were unevenly distributed in the entire cp genome. All preferred synonymous codons were found to use A/T ending codons. The difference in GC contents of entire genomes and of the three-codon positions suggests that the D. stramonium cp genome might possess different genomic organization, in part due to different mutational pressures. The five most divergent coding regions and four non-coding regions (trnH-psbA, rps4-trnS, ndhD-ccsA, and ndhI-ndhG) were identified using whole plastome alignment, which can be used to develop molecular markers for phylogenetics and barcoding studies within the Solanaceae. Phylogenetic analysis based on 68 protein-coding genes supported Datura as a sister to Solanum. This study provides valuable information for phylogenetic and cp genetic engineering studies of this poisonous and medicinal plant.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Composition
  • Codon
  • Computational Biology
  • Datura stramonium / classification
  • Datura stramonium / genetics*
  • Genetic Engineering
  • Genome, Chloroplast*
  • Genomics
  • Microsatellite Repeats
  • Molecular Sequence Annotation
  • Phylogeny
  • Plants, Medicinal / classification
  • Plants, Medicinal / genetics*
  • Plants, Toxic / classification
  • Plants, Toxic / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Sequence Analysis, DNA

Substances

  • Codon

Grants and funding

This work was supported by Macau Science and Technology Development Fund (http://www.fdct.gov.mo/) (077/2011/A3, 074/2012/A3), Research committee of University of Macau (http://www.umac.mo/research/research_committee.html) (MYRG208A (Y3-L4)-ICMS11-WYT, MRG012/WYT/2013/ICMS, MRG013/WYT/2013/ICMS) and the National Natural Science Foundation of China (81202860, 81303160). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.