Evaluation of similarity measures for gene expression data and their correspondent combined measures

Interdiscip Sci. 2009 Mar;1(1):72-80. doi: 10.1007/s12539-008-0005-3. Epub 2009 Jun 10.

Abstract

It is commonly considered that genes with similar expression profiles are functional related. And there are many ways to measure the similarity of gene expression data. Especially with the advent of lots of biological information, new combined measures have been constructed by combining the biological information with different similarity measures. However, it is not clear that what is the most suitable and effective measure for gene expression data and what is the most suitable measure to construct the most effective combined measure. In this paper, several similarity measures are analyzed and two new similarity measures are proposed. Their correspondent combined measures are also constructed by incorporating Gene Ontology annotations. All these measures are evaluated by their effectiveness in detecting functionally links in several different gene expression data by comparing with the protein-protein interaction database. The results show that the newly proposed measures and their correspondent combined measures are very effective and suitable for different datasets. And our methodology is applicable to evaluate new similarity measures and detect the best measure for a certain dataset.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic*
  • Evaluation Studies as Topic*
  • Gene Expression Regulation, Fungal*
  • Reproducibility of Results
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism

Substances

  • Saccharomyces cerevisiae Proteins