Simple and flexible sign and rank-based methods for testing for differential abundance in microbiome studies

Leyla Kodalci; Olivier Thas

doi:10.1371/journal.pone.0292055

Simple and flexible sign and rank-based methods for testing for differential abundance in microbiome studies

PLoS One. 2023 Sep 26;18(9):e0292055. doi: 10.1371/journal.pone.0292055. eCollection 2023.

Authors

Leyla Kodalci¹, Olivier Thas^{1

2

3}

Affiliations

¹ Data Science Institute and I-BioStat, Hasselt University, Diepenbeek, Belgium.
² Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Gent, Belgium.
³ National Institute for Applied Statistics Research Australia (NIASRA), University of Wollongong, Wollongong, New South Wales, Australia.

Abstract

Microbiome data obtained with amplicon sequencing are considered as compositional data. It has been argued that these data can be analysed after appropriate transformation to log-ratios, but ratios and logarithms cause problems with the many zeroes in typical microbiome experiments. We demonstrate that some well chosen sign and rank transformations also allow for valid inference with compositional data, and we show how logistic regression and probabilistic index models can be used for testing for differential abundance, while inheriting the flexibility of a statistical modelling framework. The results of a simulation study demonstrate that the new methods perform better than most other methods, and that it is comparable with ANCOM-BC. These methods are implemented in an R-package 'signtrans' and can be installed from Github (https://github.com/lucp9827/signtrans).

Copyright: © 2023 Kodalci, Thas. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Microbiota* / genetics
Models, Statistical

Grants and funding

This work was supported by the Special Research Fund (BOF) of Hasselt University [BOF21GP17]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.