Skmer: assembly-free and alignment-free sample identification using genome skims

Genome Biol. 2019 Feb 13;20(1):34. doi: 10.1186/s13059-019-1632-4.

Abstract

The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computationally. This approach discards the nuclear DNA, which constitutes the vast majority of the data. In contrast, we suggest using all unassembled reads. We introduce an assembly-free and alignment-free tool, Skmer, to compute genomic distances between the query and reference genome skims. Skmer shows excellent accuracy in estimating distances and identifying the closest match in reference datasets.

Keywords: Alignment-free; Assembly-free; DNA Barcoding; DNA reference database; Genome skimming; Second generation sequencing.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Birds / genetics
  • DNA Barcoding, Taxonomic / methods*
  • Genome, Insect*
  • Genomics / methods*
  • Models, Genetic*
  • Phylogeny