Linear programming based gene expression model (LPM-GEM) predicts the carbon source for Bacillus subtilis

BMC Bioinformatics. 2022 Jun 10;23(1):226. doi: 10.1186/s12859-022-04742-7.

Abstract

Background: Elucidating cellular metabolism led to many breakthroughs in biotechnology, synthetic biology, and health sciences. To date, deriving metabolic fluxes by 13C tracer experiments is the most prominent approach for studying metabolic fluxes quantitatively, often with high accuracy and precision. However, the technique has a high demand for experimental resources. Alternatively, flux balance analysis (FBA) has been employed to estimate metabolic fluxes without labeling experiments. It is less informative but can benefit from the low costs and low experimental efforts and gain flux estimates in experimentally difficult conditions. Methods to integrate relevant experimental data have been emerged to improve FBA flux estimations. Data from transcription profiling is often selected since it is easy to generate at the genome scale, typically embedded by a discretization of differential and non-differential expressed genes coding for the respective enzymes.

Result: We established the novel method Linear Programming based Gene Expression Model (LPM-GEM). LPM-GEM linearly embeds gene expression into FBA constraints. We implemented three strategies to reduce thermodynamically infeasible loops, which is a necessary prerequisite for such an omics-based model building. As a case study, we built a model of B. subtilis grown in eight different carbon sources. We obtained good flux predictions based on the respective transcription profiles when validating with 13C tracer based metabolic flux data of the same conditions. We could well predict the specific carbon sources. When testing the model on another, unseen dataset that was not used during training, good prediction performance was also observed. Furthermore, LPM-GEM outperformed a well-established model building methods.

Conclusion: Employing LPM-GEM integrates gene expression data efficiently. The method supports gene expression-based FBA models and can be applied as an alternative to estimate metabolic fluxes when tracer experiments are inappropriate.

Keywords: Bacillus subtilis; Carbon source; Constraint-based modeling; Flux balance analysis; Mixed-integer linear programming; Thermodynamically infeasible loops; Transcriptomics.

MeSH terms

  • Bacillus subtilis* / genetics
  • Bacillus subtilis* / metabolism
  • Carbon* / metabolism
  • Gene Expression
  • Metabolic Networks and Pathways
  • Models, Biological
  • Programming, Linear

Substances

  • Carbon