Metabolomic network analysis of estrogen-stimulated MCF-7 cells: a comparison of overrepresentation analysis, quantitative enrichment analysis and pathway analysis versus metabolite network analysis

Arch Toxicol. 2017 Jan;91(1):217-230. doi: 10.1007/s00204-016-1695-x. Epub 2016 Apr 2.

Abstract

In the context of the Human Toxome project, mass spectroscopy-based metabolomics characterization of estrogen-stimulated MCF-7 cells was studied in order to support the untargeted deduction of pathways of toxicity. A targeted and untargeted approach using overrepresentation analysis (ORA), quantitative enrichment analysis (QEA) and pathway analysis (PA) and a metabolite network approach were compared. Any untargeted approach necessarily has some noise in the data owing to artifacts, outliers and misidentified metabolites. Depending on the chemical analytical choices (sample extraction, chromatography, instrument and settings, etc.), only a partial representation of all metabolites will be achieved, biased by both the analytical methods and the database used to identify the metabolites. Here, we show on the one hand that using a data analysis approach based exclusively on pathway annotations has the potential to miss much that is of interest and, in the case of misidentified metabolites, can produce perturbed pathways that are statistically significant yet uninformative for the biological sample at hand. On the other hand, a targeted approach, by narrowing its focus and minimizing (but not eliminating) misidentifications, renders the likelihood of a spurious pathway much smaller, but the limited number of metabolites also makes statistical significance harder to achieve. To avoid an analysis dependent on pathways, we built a de novo network using all metabolites that were different at 24 h with and without estrogen with a p value <0.01 (53) in the STITCH database, which links metabolites based on known reactions in the main metabolic network pathways but also based on experimental evidence and text mining. The resulting network contained a "connected component" of 43 metabolites and helped identify non-endogenous metabolites as well as pathways not visible by annotation-based approaches. Moreover, the most highly connected metabolites (energy metabolites such as pyruvate and alpha-ketoglutarate, as well as amino acids) showed only a modest change between proliferation with and without estrogen. Here, we demonstrate that estrogen has subtle but potentially phenotypically important alterations in the acyl-carnitine fatty acids, acetyl-putrescine and succinoadenosine, in addition to likely subtle changes in key energy metabolites that, however, could not be verified consistently given the technical limitations of this approach. Finally, we show that a network-based approach combined with text mining identifies pathways that would otherwise neither be considered statistically significant on their own nor be identified via ORA, QEA, or PA.

Keywords: Bioinformatics; Computational toxicology; Endocrine disruption; Estrogen; Metabolomics; Pathway analysis.

Publication types

  • Comparative Study

MeSH terms

  • Chromatography, High Pressure Liquid
  • Computational Biology
  • Data Mining
  • Databases, Factual
  • Endocrine Disruptors / pharmacology
  • Energy Metabolism / drug effects*
  • Estradiol / pharmacology*
  • Estrogens / pharmacology*
  • Humans
  • MCF-7 Cells
  • Metabolome / drug effects*
  • Metabolomics / methods*
  • Models, Biological*
  • Reproducibility of Results
  • Secondary Metabolism / drug effects*
  • Spectrometry, Mass, Electrospray Ionization
  • Toxicology / methods*

Substances

  • Endocrine Disruptors
  • Estrogens
  • Estradiol