Systematic error in seed plant phylogenomics

Genome Biol Evol. 2011:3:1340-8. doi: 10.1093/gbe/evr105. Epub 2011 Oct 19.

Abstract

Resolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences. To improve taxon sampling of Cupressophyta (non-Pinaceae conifers), we report sequences from three new chloroplast (cp) genomes of Southern Hemisphere conifers. We have applied a site pattern sorting criterion to study compositional heterogeneity, heterotachy, and the fit of conifer chloroplast genome sequences to a general time reversible + G substitution model. We show that non-time reversible properties of aligned sequence positions in the chloroplast genomes of Gnetales mislead phylogenetic reconstruction of these seed plants. When 2,250 of the most varied sites in our concatenated alignment are excluded, phylogenetic analyses favor a close evolutionary relationship between the Gnetales and Pinaceae-the Gnepine hypothesis. Our analytical protocol provides a useful approach for evaluating the robustness of phylogenomic inferences. Our findings highlight the importance of goodness of fit between substitution model and data for understanding seed plant phylogeny.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA, Chloroplast / genetics
  • Genome, Chloroplast*
  • Gnetophyta / classification*
  • Gnetophyta / genetics
  • Models, Genetic
  • Phylogeny*
  • Seeds / genetics*
  • Tracheophyta / classification*
  • Tracheophyta / genetics

Substances

  • DNA, Chloroplast