Computing autocatalytic sets to unravel inconsistencies in metabolic network reconstructions

Bioinformatics. 2015 Feb 1;31(3):373-81. doi: 10.1093/bioinformatics/btu658. Epub 2014 Oct 5.

Abstract

Motivation: Genome-scale metabolic network reconstructions have been established as a powerful tool for the prediction of cellular phenotypes and metabolic capabilities of organisms. In recent years, the number of network reconstructions has been constantly increasing, mostly because of the availability of novel (semi-)automated procedures, which enabled the reconstruction of metabolic models based on individual genomes and their annotation. The resulting models are widely used in numerous applications. However, the accuracy and predictive power of network reconstructions are commonly limited by inherent inconsistencies and gaps.

Results: Here we present a novel method to validate metabolic network reconstructions based on the concept of autocatalytic sets. Autocatalytic sets correspond to collections of metabolites that, besides enzymes and a growth medium, are required to produce all biomass components in a metabolic model. These autocatalytic sets are well-conserved across all domains of life, and their identification in specific genome-scale reconstructions allows us to draw conclusions about potential inconsistencies in these models. The method is capable of detecting inconsistencies, which are neglected by other gap-finding methods. We tested our method on the Model SEED, which is the largest repository for automatically generated genome-scale network reconstructions. In this way, we were able to identify a significant number of missing pathways in several of these reconstructions. Hence, the method we report represents a powerful tool to identify inconsistencies in large-scale metabolic networks.

Availability and implementation: The method is available as source code on http://users.minet.uni-jena.de/∼m3kach/ASBIG/ASBIG.zip.

Contact: christoph.kaleta@uni-jena.de

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacterial Proteins / metabolism*
  • Catalytic Domain
  • Computational Biology*
  • Genome, Bacterial / genetics*
  • Metabolic Networks and Pathways / genetics*
  • Models, Biological
  • Phenotype
  • Software*

Substances

  • Bacterial Proteins