Simple and flexible sign and rank-based methods for testing for differential abundance in microbiome studies

PLoS One. 2023 Sep 26;18(9):e0292055. doi: 10.1371/journal.pone.0292055. eCollection 2023.

Abstract

Microbiome data obtained with amplicon sequencing are considered as compositional data. It has been argued that these data can be analysed after appropriate transformation to log-ratios, but ratios and logarithms cause problems with the many zeroes in typical microbiome experiments. We demonstrate that some well chosen sign and rank transformations also allow for valid inference with compositional data, and we show how logistic regression and probabilistic index models can be used for testing for differential abundance, while inheriting the flexibility of a statistical modelling framework. The results of a simulation study demonstrate that the new methods perform better than most other methods, and that it is comparable with ANCOM-BC. These methods are implemented in an R-package 'signtrans' and can be installed from Github (https://github.com/lucp9827/signtrans).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Microbiota* / genetics
  • Models, Statistical

Grants and funding

This work was supported by the Special Research Fund (BOF) of Hasselt University [BOF21GP17]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.