MODBASE, a database of annotated comparative protein structure models

Nucleic Acids Res. 2002 Jan 1;30(1):255-9. doi: 10.1093/nar/30.1.255.

Abstract

MODBASE (http://guitar.rockefeller.edu/modbase) is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on PSI-BLAST, IMPALA and MODELLER. MODBASE uses the MySQL relational database management system for flexible and efficient querying, and the MODVIEW Netscape plugin for viewing and manipulating multiple sequences and structures. It is updated regularly to reflect the growth of the protein sequence and structure databases, as well as improvements in the software for calculating the models. For ease of access, MODBASE is organized into different datasets. The largest dataset contains models for domains in 304 517 out of 539 171 unique protein sequences in the complete TrEMBL database (23 March 2001); only models based on significant alignments (PSI-BLAST E-value < 10(-4)) and models assessed to have the correct fold are included. Other datasets include models for target selection and structure-based annotation by the New York Structural Genomics Research Consortium, models for prediction of genes in the Drosophila melanogaster genome, models for structure determination of several ribosomal particles and models calculated by the MODWEB comparative modeling web server.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Computer Graphics
  • Database Management Systems
  • Databases, Protein*
  • Drosophila melanogaster / chemistry
  • Drosophila melanogaster / genetics
  • Forecasting
  • Genome
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Models, Molecular*
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / physiology
  • Ribosomal Proteins / chemistry
  • Sequence Alignment
  • Sequence Homology, Amino Acid
  • User-Computer Interface

Substances

  • Proteins
  • Ribosomal Proteins