The Presence of Genotoxic and/or Pro-inflammatory Bacterial Genes in Gut Metagenomic Databases and Their Possible Link With Inflammatory Bowel Diseases

Front Genet. 2018 Apr 10:9:116. doi: 10.3389/fgene.2018.00116. eCollection 2018.

Abstract

Background: The human gut microbiota is a dynamic community of microorganisms that mediate important biochemical processes. Differences in the gut microbial composition have been associated with inflammatory bowel diseases (IBD) and other intestinal disorders. In this study, we quantified and compared the frequencies of eight genotoxic and/or pro-inflammatory bacterial genes found in metagenomic Whole Genome Sequences (mWGSs) of samples from individuals with IBD vs. a cohort of healthy human subjects. Methods: The eight selected gene sequences were clbN, clbB, cif, cnf-1, usp, tcpC from Escherichia coli, gelE from Enterococcus faecalis and murB from Akkermansia muciniphila. We also included the sequences for the conserved murB genes from E. coli and E. faecalis as markers for the presence of Enterobacteriaceae or Enterococci in the samples. The gene sequences were chosen based on their previously reported ability to disrupt normal cellular processes to either promote inflammation or to cause DNA damage in cultured cells or animal models, which could be linked to a role in IBD. The selected sequences were searched in three different mWGS datasets accessed through the Human Microbiome Project (HMP): a healthy cohort (N = 251), a Crohn's disease cohort (N = 60) and an ulcerative colitis cohort (N = 17). Results: Firstly, the sequences for the murB housekeeping genes from Enterobacteriaceae and Enterococci were more frequently found in the IBD cohorts (32% E. coli in IBD vs. 12% in healthy; 13% E. faecalis in IBD vs. 3% in healthy) than in the healthy cohort, confirming earlier reports of a higher presence of both of these taxa in IBD. For some of the sequences in our study, especially usp and gelE, their frequency was even more sharply increased in the IBD cohorts than in the healthy cohort, suggesting an association with IBD that is not easily explained by the increased presence of E. coli or E. faecalis in those samples. Conclusion: Our results suggest a significant association between the presence of some of these genotoxic or pro-inflammatory gene sequences and IBDs. In addition, these results illustrate the power and limitations of the HMP database in the detection of possible clinical correlations for individual bacterial genes.

Keywords: bioinformatics; gelE; human microbiome; inflammatory bowel diseases; metagenomics; next-generation sequencing data; usp.