From ontology to semantic similarity: calculation of ontology-based semantic similarity

ScientificWorldJournal. 2013:2013:793091. doi: 10.1155/2013/793091. Epub 2013 Feb 28.

Abstract

Advances in high-throughput experimental techniques in the past decade have enabled the explosive increase of omics data, while effective organization, interpretation, and exchange of these data require standard and controlled vocabularies in the domain of biological and biomedical studies. Ontologies, as abstract description systems for domain-specific knowledge composition, hence receive more and more attention in computational biology and bioinformatics. Particularly, many applications relying on domain ontologies require quantitative measures of relationships between terms in the ontologies, making it indispensable to develop computational methods for the derivation of ontology-based semantic similarity between terms. Nevertheless, with a variety of methods available, how to choose a suitable method for a specific application becomes a problem. With this understanding, we review a majority of existing methods that rely on ontologies to calculate semantic similarity between terms. We classify existing methods into five categories: methods based on semantic distance, methods based on information content, methods based on properties of terms, methods based on ontology hierarchy, and hybrid methods. We summarize characteristics of each category, with emphasis on basic notions, advantages and disadvantages of these methods. Further, we extend our review to software tools implementing these methods and applications using these methods.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Databases, Genetic
  • Genetic Predisposition to Disease / genetics
  • Natural Language Processing
  • Neural Networks, Computer
  • Phenotype
  • Protein Interaction Maps
  • Semantics*
  • Software*
  • Vocabulary, Controlled*