How effective are DNA barcodes in the identification of African rainforest trees?

PLoS One. 2013;8(4):e54921. doi: 10.1371/journal.pone.0054921. Epub 2013 Apr 2.

Abstract

Background: DNA barcoding of rain forest trees could potentially help biologists identify species and discover new ones. However, DNA barcodes cannot always distinguish between closely related species, and the size and completeness of barcode databases are key parameters for their successful application. We test the ability of rbcL, matK and trnH-psbA plastid DNA markers to identify rain forest trees at two sites in Atlantic central Africa under the assumption that a database is exhaustive in terms of species content, but not necessarily in terms of haplotype diversity within species.

Methodology/principal findings: We assess the accuracy of identification to species or genus using a genetic distance matrix between samples either based on a global multiple sequence alignment (GD) or on a basic local alignment search tool (BLAST). Where a local database is available (within a 50 ha plot), barcoding was generally reliable for genus identification (95-100% success), but less for species identification (71-88%). Using a single marker, best results for species identification were obtained with trnH-psbA. There was a significant decrease of barcoding success in species-rich clades. When the local database was used to identify the genus of trees from another region and did include all genera from the query individuals but not all species, genus identification success decreased to 84-90%. The GD method performed best but a global multiple sequence alignment is not applicable on trnH-psbA.

Conclusions/significance: Barcoding is a useful tool to assign unidentified African rain forest trees to a genus, but identification to a species is less reliable, especially in species-rich clades, even using an exhaustive local database. Combining two markers improves the accuracy of species identification but it would only marginally improve genus identification. Finally, we highlight some limitations of the BLAST algorithm as currently implemented and suggest possible improvements for barcoding applications.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cameroon
  • DNA Barcoding, Taxonomic*
  • Databases, Genetic
  • Genes, Plant
  • Plastids / genetics
  • Polymorphism, Genetic
  • Species Specificity
  • Trees / classification*
  • Trees / genetics*
  • Tropical Climate

Grants and funding

DNA extraction and sequencing were financed by the Centre National de Séquençage (Genoscope, France, http://www.genoscope.cns.fr/spip/), the Government of Canada through Genome Canada and the Ontario Genomics Institute (2008-OGI-ICI-03), and by the Belgian Fund for Scientific Research (Fonds de la Recherche Scientifique-FNRS, http://www1.frs-fnrs.be/) through grant MIS 4.519.10. IP and JD (postdoctoral researchers), and OJH (research associate) are financed by the FRS-FNRS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.