Plastid genome sequence of a wild woody oil species, Prinsepia utilis, provides insights into evolutionary and mutational patterns of Rosaceae chloroplast genomes

PLoS One. 2013 Sep 2;8(9):e73946. doi: 10.1371/journal.pone.0073946. eCollection 2013.

Abstract

Background: Prinsepiautilis Royle is a wild woody oil species of Rosaceae that yields edible oil which has been proved to possess particular benefits for human health and medical therapy. However, the lack of bred varieties has largely impeded exploiting immense potentials for high quality of its seed oil. It is urgently needed to enlarge the knowledge of genetic basis of the species and develop genetic markers to enhance modern breeding programs.

Results: Here we reported the complete chloroplast (cp) genome of 156,328 bp. Comparative cp sequence analyses of P. utilis along with other four Rosaceae species resulted in similar genome structures, gene orders, and gene contents. Contraction/expansion of inverted repeat regions (IRs) explained part of the length variation in the Rosaceae cp genomes. Genome sequence alignments revealed that nucleotide diversity was associated with AT content, and large single copy regions (LSC) and small single copy regions (SSC) harbored higher sequence variations in both coding and non-coding regions than IRs. Simple sequence repeats (SSRs) were detected in the P. utilis and compared with those of the other four Rosaceae cp genomes. Almost all the SSR loci were composed of A or T, therefore it might contribute to the A-T richness of cp genomes and be associated with AT biased sequence variation. Among all the protein-coding genes, ycf1 showed the highest sequence divergence, indicating that it could accomplish the discrimination of species within Rosaceae as well as within angiosperms better than other genes.

Conclusions: With the addition of this new sequenced cp genome, high nucleotide substitution rate and abundant deletions/insertions were observed, suggesting a greater genomic dynamics than previously explored in Rosaceae. The availability of the complete cp genome of P. utilis will provide chloroplast markers and genetic information to better enhance the conservation and utilization of this woody oil plant.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chloroplasts / genetics*
  • Evolution, Molecular*
  • Genetic Markers / genetics
  • Genome, Plant / genetics*
  • Genomics
  • Microsatellite Repeats / genetics
  • Molecular Sequence Data
  • Mutation*
  • Phylogeny
  • Rosaceae / genetics*

Substances

  • Genetic Markers

Associated data

  • GENBANK/KC571835

Grants and funding

This work was supported by Key Project of Natural Science Foundation of Yunnan Province (2010CC011), Top Talents Program of Yunnan Province (20080A009), Hundreds Oversea Talents Program of Yunnan Province, Hundreds Talents Program of Chinese Academy of Sciences (CAS), a grant from the CAS (KSCX2-YW-N-029), a grant from Chinese Department of Science and Technology (973 Program 2007CB815703) and a startup grant of Kunming Institute of Botany, CAS (to LZG). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.