A chemical structure and machine learning approach to assess the potential bioactivity of endogenous metabolites and their association with early-childhood hs-CRP levels

bioRxiv [Preprint]. 2023 Nov 16:2023.11.15.567095. doi: 10.1101/2023.11.15.567095.

Abstract

Metabolomics has gained much attraction due to its potential to reveal molecular disease mechanisms and present viable biomarkers. In this work we used a panel of untargeted serum metabolomes in 602 childhood patients of the COPSAC2010 mother-child cohort. The annotated part of the metabolome consists of 493 chemical compounds curated using automated procedures. Using predicted quantitative-structure-bioactivity relationships for the Tox21 database on nuclear receptors and stress response in cell lines, we created a filtering method for the vast number of quantified metabolites. The metabolites measured in children's serums used here have predicted potential against the chosen target modelled targets. The targets from Tox21 have been used with quantitative structure-activity relationships (QSARs) and were trained for ~7000 structures, saved as models, and then applied to 493 metabolites to predict their potential bioactivities. The models were selected based on strict accuracy criteria surpassing random effects. After application, 52 metabolites showed potential bioactivity based on structural similarity with known active compounds from the Tox21 set. The filtered compounds were subsequently used and weighted by their bioactive potential to show an association with early childhood hs-CRP levels at six months in a linear model supporting a physiological adverse effect on systemic low-grade inflammation. The significant metabolites were reported.

Keywords: corticol; cortisone; inflammation; metabolomics; qsar; vitamin a.

Publication types

  • Preprint