LinDA: linear models for differential abundance analysis of microbiome compositional data

Huijuan Zhou; Kejun He; Jun Chen; Xianyang Zhang

doi:10.1186/s13059-022-02655-5

LinDA: linear models for differential abundance analysis of microbiome compositional data

Genome Biol. 2022 Apr 14;23(1):95. doi: 10.1186/s13059-022-02655-5.

Authors

Huijuan Zhou^{1

2

3}, Kejun He³, Jun Chen⁴, Xianyang Zhang⁵

Affiliations

¹ Shanghai University of Finance and Economics, Shanghai, 200437, China.
² Texas A&M University, College Station, 77843, USA.
³ Renmin University of China, Beijing, 100872, China.
⁴ Mayo Clinic, Rochester, USA. chen.jun2@mayo.edu.
⁵ Texas A&M University, College Station, 77843, USA. zhangxiany@stat.tamu.edu.

Abstract

Differential abundance analysis is at the core of statistical analysis of microbiome data. The compositional nature of microbiome sequencing data makes false positive control challenging. Here, we show that the compositional effects can be addressed by a simple, yet highly flexible and scalable, approach. The proposed method, LinDA, only requires fitting linear regression models on the centered log-ratio transformed data, and correcting the bias due to compositional effects. We show that LinDA enjoys asymptotic FDR control and can be extended to mixed-effect models for correlated microbiome data. Using simulations and real examples, we demonstrate the effectiveness of LinDA.

Keywords: Compositional effect; Differential abundance analysis; False discovery rate; Multiple testing.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.
Research Support, N.I.H., Extramural

MeSH terms

Animals
Coleoptera*
Linear Models
Microbiota*
Research Design

Abstract

Publication types

MeSH terms

Grants and funding