GMB: an efficient query processor for biological data

J Integr Bioinform. 2011 Aug 31;8(2):165. doi: 10.2390/biecoll-jib-2011-165.

Abstract

Bioinformatics applications manage complex biological data stored into distributed and often heterogeneous databases and require large computing power. These databases are too big and complicated to be rapidly queried every time a user submits a query, due to the overhead involved in decomposing the queries, sending the decomposed queries to remote databases, and composing the results. There is also considerable communication costs involved. This study addresses the mentioned problems in Grid-based environment for bioinformatics. We propose a Grid middleware called GMB that alleviates these problems by caching the results of Frequently Used Queries (FUQ). Queries are classified based on their types and frequencies. FUQ are answered from the middleware, which improves their response time. GMB acts as a gateway to TeraGrid Grid: it resides between users’ applications and TeraGrid Grid. We evaluate GMB experimentally.

MeSH terms

  • Programming Languages
  • Search Engine / methods*
  • Software*
  • Statistics as Topic*
  • Time Factors