A Measure of the DNA Barcode Gap for Applied and Basic Research

Methods Mol Biol. 2024:2744:375-390. doi: 10.1007/978-1-0716-3581-0_24.

Abstract

DNA barcoding has largely established itself as a mainstay for rapid molecular taxonomic identification in both academic and applied research. The use of DNA barcoding as a molecular identification method depends on a "DNA barcode gap"-the separation between the maximum within-species difference and the minimum between-species difference. Previous work indicates the presence of a gap hinges on sampling effort for focal taxa and their close relatives. Furthermore, both theory and empirical work indicate a gap may not occur for related pairs of biological species. Here, we present a novel evaluation approach in the form of an easily calculated set of nonparametric metrics to quantify the extent of proportional overlap in inter- and intraspecific distributions of pairwise differences among target species and their conspecifics. The metrics are based on a simple count of the number of overlapping records for a species falling within the bounds of maximum intraspecific distance and minimum interspecific distance. Our approach takes advantage of the asymmetric directionality inherent in pairwise genetic distance distributions, which has not been previously done in the DNA barcoding literature. We apply the metrics to the predatory diving beetle genus Agabus as a case study because this group poses significant identification challenges due to its morphological uniformity despite both relative sampling ease and well-established taxonomy. Results herein show that target species and their nearest neighbor species were found to be tightly clustered and therefore difficult to distinguish. Such findings demonstrate that DNA barcoding can fail to fully resolve species in certain cases. Moving forward, we suggest the implementation of the proposed metrics be integrated into a common framework to be reported in any study that uses DNA barcoding for identification. In so doing, the importance of the DNA barcode gap and its components for the success of DNA-based identification using DNA barcodes can be better appreciated.

Keywords: Bootstrapping; DNA barcoding; Interspecific genetic distance; Intraspecific genetic distance; Multispecies coalescent; Nonparametrics; Speciation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Coleoptera / classification
  • Coleoptera / genetics
  • DNA / analysis
  • DNA / genetics
  • DNA Barcoding, Taxonomic* / methods
  • Species Specificity

Substances

  • DNA