A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction

Front Genet. 2019 Jun 25:10:579. doi: 10.3389/fgene.2019.00579. eCollection 2019.

Abstract

With the growing importance of microbiome research, there is increasing evidence that host variation in microbial communities is associated with overall host health. Advancement in genetic sequencing methods for microbiomes has coincided with improvements in machine learning, with important implications for disease risk prediction in humans. One aspect specific to microbiome prediction is the use of taxonomy-informed feature selection. In this review for non-experts, we explore the most commonly used machine learning methods, and evaluate their prediction accuracy as applied to microbiome host trait prediction. Methods are described at an introductory level, and R/Python code for the analyses is provided.

Keywords: disease; machine learning; modeling; phenotype; prediction.

Publication types

  • Review