GMB: an efficient query processor for biological data

Kamal Taha; Ramez Elmasri

doi:10.2390/biecoll-jib-2011-165

GMB: an efficient query processor for biological data

J Integr Bioinform. 2011 Aug 31;8(2):165. doi: 10.2390/biecoll-jib-2011-165.

Authors

Kamal Taha¹, Ramez Elmasri

Affiliation

¹ Khalifa University of Science, Technology & Research, Abu Dhabi, UAE. kamal.taha@kustar.ac.ae

PMID: 21881166
DOI: 10.2390/biecoll-jib-2011-165

Abstract

Bioinformatics applications manage complex biological data stored into distributed and often heterogeneous databases and require large computing power. These databases are too big and complicated to be rapidly queried every time a user submits a query, due to the overhead involved in decomposing the queries, sending the decomposed queries to remote databases, and composing the results. There is also considerable communication costs involved. This study addresses the mentioned problems in Grid-based environment for bioinformatics. We propose a Grid middleware called GMB that alleviates these problems by caching the results of Frequently Used Queries (FUQ). Queries are classified based on their types and frequencies. FUQ are answered from the middleware, which improves their response time. GMB acts as a gateway to TeraGrid Grid: it resides between users’ applications and TeraGrid Grid. We evaluate GMB experimentally.

MeSH terms

Programming Languages
Search Engine / methods*
Software*
Statistics as Topic*
Time Factors