Evaluation of nine popular de novo assemblers in microbial genome assembly

J Microbiol Methods. 2017 Dec:143:32-37. doi: 10.1016/j.mimet.2017.09.008. Epub 2017 Sep 19.

Abstract

Next generation sequencing (NGS) technologies are revolutionizing biology, with Illumina being the most popular NGS platform. Short read assembly is a critical part of most genome studies using NGS. Hence, in this study, the performance of nine well-known assemblers was evaluated in the assembly of seven different microbial genomes. Effect of different read coverage and k-mer parameters on the quality of the assembly were also evaluated on both simulated and actual read datasets. Our results show that the performance of assemblers on real and simulated datasets could be significantly different, mainly because of coverage bias. According to outputs on actual read datasets, for all studied read coverages (of 7×, 25× and 100×), SPAdes and IDBA-UD clearly outperformed other assemblers based on NGA50 and accuracy metrics. Velvet is the most conservative assembler with the lowest NGA50 and error rate.

Keywords: Assembly; Genome; Illumina; N50; NGS.

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Computational Biology / methods*
  • Genome, Microbial*
  • High-Throughput Nucleotide Sequencing / methods
  • Sequence Analysis, DNA / methods*