The hemiptera (insecta) of Canada: constructing a reference library of DNA barcodes

PLoS One. 2015 Apr 29;10(4):e0125635. doi: 10.1371/journal.pone.0125635. eCollection 2015.

Abstract

DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Canada
  • DNA Barcoding, Taxonomic*
  • Gene Library*
  • Hemiptera / classification
  • Hemiptera / genetics*
  • Species Specificity

Grants and funding

This research was enabled by funding from NSERC, Genome Canada through the Ontario Genomics Institute, and by Agriculture and Agri-Food Canada. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.