Complete chloroplast genome of the multifunctional crop globe artichoke and comparison with other Asteraceae

PLoS One. 2015 Mar 16;10(3):e0120589. doi: 10.1371/journal.pone.0120589. eCollection 2015.

Abstract

With over 20,000 species, Asteraceae is the second largest plant family. High-throughput sequencing of nuclear and chloroplast genomes has allowed for a better understanding of the evolutionary relationships within large plant families. Here, the globe artichoke chloroplast (cp) genome was obtained by a combination of whole-genome and BAC clone high-throughput sequencing. The artichoke cp genome is 152,529 bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 25,155 bp, representing the longest IRs found in the Asteraceae family so far. The large (LSC) and the small (SSC) single-copy regions span 83,578 bp and 18,641 bp, respectively. The artichoke cp sequence was compared to the other eight Asteraceae complete cp genomes available, revealing an IR expansion at the SSC/IR boundary. This expansion consists of 17 bp of the ndhF gene generating an overlap between the ndhF and ycf1 genes. A total of 127 cp simple sequence repeats (cpSSRs) were identified in the artichoke cp genome, potentially suitable for future population studies in the Cynara genus. Parsimony-informative regions were evaluated and allowed to place a Cynara species within the Asteraceae family tree. The eight most informative coding regions were also considered and tested for "specific barcode" purpose in the Asteraceae family. Our results highlight the usefulness of cp genome sequencing in exploring plant genome diversity and retrieving reliable molecular resources for phylogenetic and evolutionary studies, as well as for specific barcodes in plants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asteraceae / classification
  • Asteraceae / genetics*
  • Computational Biology
  • Cynara scolymus / classification
  • Cynara scolymus / genetics*
  • DNA Barcoding, Taxonomic
  • Evolution, Molecular
  • Exons
  • Gene Order
  • Genes, Plant
  • Genetic Structures
  • Genome, Chloroplast*
  • Genomics*
  • High-Throughput Nucleotide Sequencing
  • Introns
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • Phylogeny

Associated data

  • GENBANK/KM035764
  • SRA/SRP049578
  • SRA/SRR1648410

Grants and funding

This research was supported by a dedicated grant from the Italian Ministry of Economy and Finance to the National Research Council for the project Innovazione e Sviluppo del Mezzogiorno – Conoscenze Integrate per Sostenibilità ed Innovazione del Made in Italy Agroalimentare (C.I.S.I.A.)– Legge n. 191/2009, and by the project BiodiverSO – PSR Puglia 2007-2013 Mis. 214/4 subaz. a), http://biodiversitapuglia.it/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.