A method to generate multilocus barcodes of pinned insect specimens using MiSeq

Mol Ecol Resour. 2020 May;20(3). doi: 10.1111/1755-0998.13143. Epub 2020 Mar 6.

Abstract

For molecular insect identification, amplicon sequencing methods are recommended because they offer a cost-effective approach for targeting small sets of informative genes from multiple samples. In this context, high-throughput multilocus amplicon sequencing has been achieved using the MiSeq Illumina sequencing platform. However, this approach generates short gene fragments of <500 bp, which then have to be overlapped using bioinformatics to achieve longer sequence lengths. This increases the risk of generating chimeric sequences or leads to the formation of incomplete loci. Here, we propose a modified nested amplicon sequencing method for targeting multiple loci from pinned insect specimens using the MiSeq Illumina platform. The modification exists in using a three-step nested PCR approach targeting near full-length loci in the initial PCR and subsequently amplifying short fragments of between 300 and 350 bp for high-throughput sequencing using Illumina chemistry. Using this method, we generated 407 sequences of three loci from 86% of all the specimens sequenced. Out of 103 pinned bee specimens of replicated species, 71% passed the 95% sequence similarity threshold between species replicates. This method worked best for pinned specimens aged between 0 and 5 years, with a limit of 10 years for pinned and 14 years for ethanol-preserved specimens. Hence, our method overcomes some of the challenges of amplicon sequencing using short read next generation sequencing and improves the possibility of creating high-quality multilocus barcodes from insect collections.

Keywords: MiSeq; insect collections; multilocus barcoding; pinned specimens.

MeSH terms

  • Animals
  • Bees / genetics*
  • Computational Biology / methods
  • DNA Barcoding, Taxonomic / methods*
  • High-Throughput Nucleotide Sequencing / methods*
  • Polymerase Chain Reaction / methods
  • Sequence Analysis, DNA / methods

Associated data

  • GENBANK/MH320097
  • GENBANK/MH320504