Machine-learning analysis of cross-study samples according to the gut microbiome in 12 infant cohorts

mSystems. 2023 Dec 21;8(6):e0036423. doi: 10.1128/msystems.00364-23. Epub 2023 Oct 24.

Abstract

There are challenges in merging microbiome data from diverse research groups due to the intricate and multifaceted nature of such data. To address this, we utilized a combination of machine-learning (ML) models to analyze 16S sequencing data from a substantial set of gut microbiome samples, sourced from 12 distinct infant cohorts that were gathered prospectively. Our initial focus was on the mode of delivery due to its prior association with changes in infant gut microbiomes. Through ML analysis, we demonstrated the effective merging and comparison of various gut microbiome data sets, facilitating the identification of robust microbiome biomarkers applicable across varied study populations.

Keywords: bioinformatics; children; cross-study; ensemble; gut microbiome; human microbiome; infant; machine learning; random forest.

MeSH terms

  • Feces
  • Gastrointestinal Microbiome* / genetics
  • Humans
  • Infant
  • Machine Learning
  • Microbiota*
  • RNA, Ribosomal, 16S / genetics

Substances

  • RNA, Ribosomal, 16S