APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data

Dominik Buchner; Till-Hendrik Macher; Florian Leese

doi:10.1093/bioinformatics/btac588

APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data

Bioinformatics. 2022 Oct 14;38(20):4817-4819. doi: 10.1093/bioinformatics/btac588.

Authors

Dominik Buchner¹, Till-Hendrik Macher¹, Florian Leese^{1

2}

Affiliations

¹ University of Duisburg-Essen, Faculty of Biology, Aquatic Ecosystem Research, Essen 45141, Germany.
² Univeresity of Duisburg-Essen, Centre for Water and Environmental Research (ZWU), Essen 45141, Germany.

Abstract

Summary: DNA metabarcoding is an emerging approach to assess and monitor biodiversity worldwide and consequently the number and size of data sets increases exponentially. To date, no published DNA metabarcoding data processing pipeline exists that is (i) platform independent, (ii) easy to use [incl. graphical user interface (GUI)], (iii) fast (does scale well with dataset size) and (iv) complies with data protection regulations of e.g. environmental agencies. The presented pipeline APSCALE meets these requirements and handles the most common tasks of sequence data processing, such as paired-end merging, primer trimming, quality filtering, clustering and denoising of any popular metabarcoding marker, such as internal transcribed spacer, 16S or cytochrome c oxidase subunit I. APSCALE comes in a command line and a GUI version. The latter provides the user with additional summary statistics options and links to GUI-based downstream applications.

Availability and implementation: APSCALE is written in Python, a platform-independent language, and integrates functions of the open-source tools, VSEARCH (Rognes et al., 2016), cutadapt (Martin, 2011) and LULU (Frøslev et al., 2017). All modules support multithreading to allow fast processing of larger DNA metabarcoding datasets. Further information and troubleshooting are provided on the respective GitHub pages for the command-line version (https://github.com/DominikBuchner/apscale) and the GUI-based version (https://github.com/TillMacher/apscale_gui), including a detailed tutorial.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

DNA Barcoding, Taxonomic*
Electron Transport Complex IV
Software*

Substances

Electron Transport Complex IV