PANDORA: keyword-based analysis of protein sets by integration of annotation sources

Noam Kaplan; Avishay Vaaknin; Michal Linial

doi:10.1093/nar/gkg769

PANDORA: keyword-based analysis of protein sets by integration of annotation sources

Nucleic Acids Res. 2003 Oct 1;31(19):5617-26. doi: 10.1093/nar/gkg769.

Authors

Noam Kaplan¹, Avishay Vaaknin, Michal Linial

Affiliation

¹ Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel.

Abstract

Recent advances in high-throughput methods and the application of computational tools for automatic classification of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a web-based tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports SwissProt keywords, NCBI Taxonomy, InterPro entries and the hierarchical classification terms from ENZYME, SCOP and GO databases. The integrated study of several annotation sources simultaneously allows a representation of biological relations of structure, function, cellular location, taxonomy, domains and motifs. PANDORA is also integrated into the ProtoNet system, thus allowing testing thousands of automatically generated clusters. We illustrate how PANDORA enhances the biological understanding of large, non-uniform sets of proteins originating from experimental and computational sources, without the need for prior biological knowledge on individual proteins.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Computer Graphics
Databases, Protein
Internet
Proteins / chemistry
Proteins / classification*
Proteins / physiology
Software*
Systems Integration
Terminology as Topic

Substances

Proteins