Highly comparable metabarcoding results from MGI-Tech and Illumina sequencing platforms

PeerJ. 2021 Sep 30:9:e12254. doi: 10.7717/peerj.12254. eCollection 2021.

Abstract

With the developments in DNA nanoball sequencing technologies and the emergence of new platforms, there is an increasing interest in their performance in comparison with the widely used sequencing-by-synthesis methods. Here, we test the consistency of metabarcoding results from DNBSEQ-G400RS (DNA nanoball sequencing platform by MGI-Tech) and NovaSeq 6000 (sequencing-by-synthesis platform by Illumina) platforms using technical replicates of DNA libraries that consist of COI gene amplicons from 120 soil DNA samples. By subjecting raw sequencing data from both platforms to a uniform bioinformatics processing, we found that the proportion of high-quality reads passing through the filtering steps was similar in both datasets. Per-sample operational taxonomic unit (OTU) and amplicon sequence variant (ASV) richness patterns were highly correlated, but sequencing data from DNBSEQ-G400RS harbored a higher number of OTUs. This may be related to the lower dominance of most common OTUs in DNBSEQ data set (thus revealing higher richness by detecting rare taxa) and/or to a lower effective read quality leading to generation of spurious OTUs. However, there was no statistical difference in the ASV and post-clustered ASV richness between platforms, suggesting that additional denoising step in the ASV workflow had effectively removed the 'noisy' reads. Both OTU-based and ASV-based composition were strongly correlated between the sequencing platforms, with essentially interchangeable results. Therefore, we conclude that DNBSEQ-G400RS and NovaSeq 6000 are both equally efficient high-throughput sequencing platforms to be utilized in studies aiming to apply the metabarcoding approach, but the main benefit of the former is related to lower sequencing cost.

Keywords: COI; DNBSEQ; Illumina; MGI-Tech; Metabarcoding; NovaSeq.

Grants and funding

Funding for this study was provided by the Novo Nordisk Foundation grant NNF20OC0059948 (Silva Nova), EEA Financial Mechanism Baltic Research Programme in Estonia (SUCC, EMP442) and the Estonian Research Council (Mobilitas Pluss grant MOBTP198). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.