PTGL: a database for secondary structure-based protein topologies

Nucleic Acids Res. 2010 Jan;38(Database issue):D326-30. doi: 10.1093/nar/gkp980. Epub 2009 Nov 11.

Abstract

With growing amount of experimental data, the number of known protein structures also increases continuously. Classification of protein structures helps to understand relationships between protein structure and function. The main classification methods based on secondary structures are SCOP, CATH and TOPS, which all classify under different aspects, and therefore can lead to different results. We developed a mathematically unique representation of protein structure topologies at a higher abstraction level providing new aspects of classification and enabling for a fast search through the data. Protein Topology Graph Library (PTGL; http://ptgl.zib.de) aims at providing a database on protein secondary structure topologies, including search facilities, the visualization as intuitive topology diagrams as well as in the 3D structure, and additional information. Secondary structure-based protein topologies are represented uniquely as undirected labeled graphs in four different ways allowing for exploration under different aspects. The linear notations, and the 2D and 3D diagrams of each notation facilitate a deeper understanding of protein topologies. Several search functions for topologies and sub-topologies, BLAST search possibility, and links to SCOP, CATH and PDBsum support individual and large-scale investigation of protein structures. Currently, PTGL comprises topologies of 54,859 protein structures. Main structural patterns for common structural motifs like TIM-barrel or Jelly Roll are pre-implemented, and can easily be searched.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Animals
  • Computational Biology / methods*
  • Computational Biology / trends
  • Computer Graphics
  • Crystallography, X-Ray / methods
  • Databases, Genetic*
  • Databases, Protein*
  • Humans
  • Information Storage and Retrieval / methods
  • Internet
  • Protein Conformation
  • Protein Structure, Secondary*
  • Proteins / chemistry*
  • Software

Substances

  • Proteins