Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages

Nat Biotechnol. 2003 Sep;21(9):1055-62. doi: 10.1038/nbt861. Epub 2003 Aug 17.

Abstract

We introduce a general computational method, applicable on a genome-wide scale, for the systematic discovery of uncharacterized cellular systems. Quantitative analysis of the coinheritance of pairs of genes among different organisms, calculated using phylogenetic profiles, allows the prediction of thousands of functional linkages between the corresponding proteins. A comparison of these functional linkages to known pathways reveals that calculated linkages are comparable in accuracy to genome-wide yeast two-hybrid screens or mass spectrometry interaction assays. In aggregate, these linkages describe the structure of large-scale networks, with the resulting yeast network composed of 3,875 linkages among 804 proteins, and the resulting pathogenic Escherichia coli network composed of 2,043 linkages among 828 proteins. The search of such networks for groups of uncharacterized, linked proteins led to the identification of 27 novel cellular systems from one nonpathogenic and three pathogenic bacterial genomes.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Algorithms*
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Chromosome Mapping / methods*
  • Cluster Analysis
  • Energy Metabolism / physiology
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation / physiology*
  • Genome*
  • Phylogeny*
  • Proteome / genetics*
  • Proteome / metabolism*
  • Sequence Alignment
  • Sequence Analysis, Protein / methods
  • Sequence Homology, Amino Acid
  • Signal Transduction / physiology

Substances

  • Bacterial Proteins
  • Proteome