CLOE: identification of putative functional relationships among genes by comparison of expression profiles between two species

BMC Bioinformatics. 2004 Nov 19:5:179. doi: 10.1186/1471-2105-5-179.

Abstract

Background: Public repositories of microarray data contain an incredible amount of information that is potentially relevant to explore functional relationships among genes by meta-analysis of expression profiles. However, the widespread use of this resource by the scientific community is at the moment limited by the limited availability of effective tools of analysis. We here describe CLOE, a simple cDNA microarray data mining strategy based on meta-analysis of datasets from pairs of species. The method consists in ranking EST probes in the datasets of the two species according to the similarity of their expression profiles with that of two EST probes from orthologous genes, and extracting orthologous EST pairs from a given top interval of the ranked lists. The Gene Ontology annotation of the obtained candidate partners is then analyzed for keywords overrepresentation.

Results: We demonstrate the capabilities of the approach by testing its predictive power on three proteomically-defined mammalian protein complexes, in comparison with single and multiple species meta-analysis approaches. Our results show that CLOE can find candidate partners for a greater number of genes, if compared to multiple species co-expression analysis, but retains a comparable specificity even when applied to species as close as mouse and human. On the other hand, it is much more specific than single organisms co-expression analysis, strongly reducing the number of potential candidate partners for a given gene of interest.

Conclusions: CLOE represents a simple and effective data mining approach that can be easily used for meta-analysis of cDNA microarray experiments characterized by very heterogeneous coverage. Importantly, it produces for genes of interest an average number of high confidence putative partners that is in the range of standard experimental validation techniques.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data
  • Conserved Sequence / genetics
  • DNA Probes / genetics
  • DNA, Complementary / genetics
  • Evolution, Molecular
  • Expressed Sequence Tags
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / statistics & numerical data
  • Gene Expression Regulation / physiology*
  • Humans
  • Mice
  • Predictive Value of Tests
  • Sequence Homology, Nucleic Acid
  • Software

Substances

  • DNA Probes
  • DNA, Complementary