XML-based data model and architecture for a knowledge-based grid-enabled problem-solving environment for high-throughput biological imaging

IEEE Trans Inf Technol Biomed. 2008 Mar;12(2):226-40. doi: 10.1109/TITB.2007.904153.

Abstract

High-throughput biological imaging uses automated imaging devices to collect a large number of microscopic images for analysis of biological systems and validation of scientific hypotheses. Efficient manipulation of these datasets for knowledge discovery requires high-performance computational resources, efficient storage, and automated tools for extracting and sharing such knowledge among different research sites. Newly emerging grid technologies provide powerful means for exploiting the full potential of these imaging techniques. Efficient utilization of grid resources requires the development of knowledge-based tools and services that combine domain knowledge with analysis algorithms. In this paper, we first investigate how grid infrastructure can facilitate high-throughput biological imaging research, and present an architecture for providing knowledge-based grid services for this field. We identify two levels of knowledge-based services. The first level provides tools for extracting spatiotemporal knowledge from image sets and the second level provides high-level knowledge management and reasoning services. We then present cellular imaging markup language, an extensible markup language-based language for modeling of biological images and representation of spatiotemporal knowledge. This scheme can be used for spatiotemporal event composition, matching, and automated knowledge extraction and representation for large biological imaging datasets. We demonstrate the expressive power of this formalism by means of different examples and extensive experimental results.

MeSH terms

  • Artificial Intelligence*
  • Database Management Systems*
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Information Dissemination / methods
  • Information Storage and Retrieval / methods*
  • Internet*
  • Models, Theoretical
  • Pattern Recognition, Automated / methods
  • Programming Languages*
  • Radiology Information Systems*
  • United States