Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria

PLoS One. 2013;8(3):e58759. doi: 10.1371/journal.pone.0058759. Epub 2013 Mar 11.

Abstract

The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO), which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria) from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s) of infection. It can also aid in the discovery of genes associated with specific function(s) for investigation as a novel vaccine or therapeutic targets.

Availability: http://turing.ersa.edu.au/BacteriaGO.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bacteria / genetics*
  • Bacteria / metabolism*
  • Computational Biology / methods
  • Databases, Genetic
  • Gene Ontology*
  • Internet*
  • Software*
  • User-Computer Interface

Grants and funding

This work was supported by the National Health and Medical Research Council of Australia (NHMRC) Project Grant 627142 to JCP and ADO, and NHMRC Program Grant 565526 to JCP. MF is a recipient of School Divisional PhD Scholarship from The University of Adelaide. JCP is a NHMRC Australia Fellow. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.