Identifying significant associations of orthologous simple sequence repeats with gene ontologies

Int J Data Min Bioinform. 2014;9(1):37-51. doi: 10.1504/ijdmb.2014.057781.

Abstract

Simple Sequence Repeats (SSRs), also known as microsatellites, regulate gene functions. SSR mutations in a disease gene may cause various genetic disorders. To identify putative functional SSRs, a web-based system, Gene Ontology SSR Hierarchy (GOSH), was developed to facilitate discovery of significant associations between SSRs and Gene Ontology (GO) terms. Using the GO hierarchy term structure, GOSH assists users with selecting functional or biological gene subsets. Significant SSR patterns are retrieved and identified via comprehensive overrepresentation analysis within a target gene subset and by comparing results with orthologous genes. Pattern relationships between different biological subsets or supersets can be observed by using the GO hierarchy structure directly. GOSH also supports GO searching through identified significant SSR patterns and all GO terms possessing such patterns are listed for consultation. GOSH is the first comprehensive and efficient online mining tool for discovering significant orthologous SSR patterns in GO terms and is available at http://gosh.cs.ntou.edu.tw/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods*
  • Database Management Systems*
  • Databases, Genetic*
  • Gene Ontology*
  • Microsatellite Repeats / genetics*
  • Natural Language Processing*
  • Sequence Analysis, DNA / methods*
  • Sequence Homology*