An expanded mammal mitogenome dataset from Southeast Asia

Gigascience. 2017 Aug 1;6(8):1-8. doi: 10.1093/gigascience/gix053.

Abstract

Southeast (SE) Asia is 1 of the most biodiverse regions in the world, and it holds approximately 20% of all mammal species. Despite this, the majority of SE Asia's genetic diversity is still poorly characterized. The growing interest in using environmental DNA to assess and monitor SE Asian species, in particular threatened mammals-has created the urgent need to expand the available reference database of mitochondrial barcode and complete mitogenome sequences. We have partially addressed this need by generating 72 new mitogenome sequences reconstructed from DNA isolated from a range of historical and modern tissue samples. Approximately 55 gigabases of raw sequence were generated. From this data, we assembled 72 complete mitogenome sequences, with an average depth of coverage of ×102.9 and ×55.2 for modern samples and historical samples, respectively. This dataset represents 52 species, of which 30 species had no previous mitogenome data available. The mitogenomes were geotagged to their sampling location, where known, to display a detailed geographical distribution of the species. Our new database of 52 taxa will strongly enhance the utility of environmental DNA approaches for monitoring mammals in SE Asia as it greatly increases the likelihoods that identification of metabarcoding sequencing reads can be assigned to reference sequences. This magnifies the confidence in species detections and thus allows more robust surveys and monitoring programmes of SE Asia's threatened mammal biodiversity. The extensive collections of historical samples from SE Asia in western and SE Asian museums should serve as additional valuable material to further enrich this reference database.

Keywords: GenBank; invertebrate-derived (iDNA); metabarcoding; taxonomic assignment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Asia, Southeastern
  • Biodiversity
  • Computational Biology / methods
  • DNA Barcoding, Taxonomic
  • Databases, Nucleic Acid*
  • Genetic Variation
  • Genome, Mitochondrial*
  • Genomics / methods
  • Mammals / genetics*
  • Molecular Sequence Annotation
  • Phylogeny
  • Phylogeography
  • Reproducibility of Results