How can functional annotations be derived from profiles of phenotypic annotations?

BMC Bioinformatics. 2017 Feb 10;18(1):96. doi: 10.1186/s12859-017-1503-5.

Abstract

Background: Loss-of-function phenotypes are widely used to infer gene function using the principle that similar phenotypes are indicative of similar functions. However, converting phenotypic to functional annotations requires careful interpretation of phenotypic descriptions and assessment of phenotypic similarity. Understanding how functions and phenotypes are linked will be crucial for the development of methods for the automatic conversion of gene loss-of-function phenotypes to gene functional annotations.

Results: We explored the relation between cellular phenotypes from RNAi-based screens in human cells and gene annotations of cellular functions as provided by the Gene Ontology (GO). Comparing different similarity measures, we found that information content-based measures of phenotypic similarity were the best at capturing gene functional similarity. However, phenotypic similarities did not map to the Gene Ontology organization of gene function but to functions defined as groups of GO terms with shared gene annotations.

Conclusions: Our observations have implications for the use and interpretation of phenotypic similarities as a proxy for gene functions both in RNAi screen data analysis and curation and in the prediction of disease genes.

Keywords: Biological network; Cellular phenotype; Cluster analysis; Ontology.

MeSH terms

  • Area Under Curve
  • Cluster Analysis
  • Computational Biology / methods*
  • Humans
  • Phenotype
  • RNA Interference
  • ROC Curve