Estimation of metabolite networks with regard to a specific covariable: applications to plant and human data

Metabolomics. 2017;13(11):129. doi: 10.1007/s11306-017-1263-2. Epub 2017 Sep 22.

Abstract

Introduction: In systems biology, where a main goal is acquiring knowledge of biological systems, one of the challenges is inferring biochemical interactions from different molecular entities such as metabolites. In this area, the metabolome possesses a unique place for reflecting "true exposure" by being sensitive to variation coming from genetics, time, and environmental stimuli. While influenced by many different reactions, often the research interest needs to be focused on variation coming from a certain source, i.e. a certain covariable [Formula: see text].

Objective: Here, we use network analysis methods to recover a set of metabolite relationships, by finding metabolites sharing a similar relation to [Formula: see text]. Metabolite values are based on information coming from individuals' [Formula: see text] status which might interact with other covariables.

Methods: Alternative to using the original metabolite values, the total information is decomposed by utilizing a linear regression model and the part relevant to [Formula: see text] is further used. For two datasets, two different network estimation methods are considered. The first is weighted gene co-expression network analysis based on correlation coefficients. The second method is graphical LASSO based on partial correlations.

Results: We observed that when using the parts related to the specific covariable of interest, resulting estimated networks display higher interconnectedness. Additionally, several groups of biologically associated metabolites (very large density lipoproteins, lipoproteins, etc.) were identified in the human data example.

Conclusions: This work demonstrates how information on the study design can be incorporated to estimate metabolite networks. As a result, sets of interconnected metabolites can be clustered together with respect to their relation to a covariable of interest.

Keywords: Incorporating relevant information; Metabolites; Network reconstruction; Study design.