HUMAN FAMSD-BASE: high quality protein structure model database for the human genome using the FAMSD homology modeling method

Chem Pharm Bull (Tokyo). 2010 Jan;58(1):66-75. doi: 10.1248/cpb.58.66.

Abstract

Almost all proteins express their biological functions through the structural conformation of their specific amino acid sequences. Therefore, acquiring the three-dimensional structures of proteins is very important to elucidate the role of a particular protein. We had built protein structure model databases, which is called RIKEN FAMSBASE (http://famshelp.gsc.riken.jp/famsbase/). The RIKEN FAMSBASE is a genome-wide protein structure model database that contains a large number of protein models from many organisms. The HUMAN FAMSBASE that is one part of the RIKEN FAMSBASE contains many protein models for human genes, which are significant in the pharmaceutical and medicinal fields. We have now implemented an update of the human protein modeling database consisting of 242918 constructed models against the number of 20743 human protein sequences with an improved modeling method called Full Automatic protein Modeling System Developed (FAMSD). The results of our benchmark test of the FAMSD method indicated that it has an excellent capability to pack amino acid side-chains with correct torsion angles in addition to the main-chain, while avoiding the formation of atom-atom collisions that are not found in experimental structures. This new protein structure model database for human genes, which is named HUMAN FAMSD-BASE, is open to the public as a component part of the RIKEN FAMSBASE at http://mammalia.gsc.riken.jp/human_famsd/. A significant improvement of the HUMAN FAMSD-BASE in comparison with the preceding HUMAN FAMSBASE was verified in the benchmark test of this paper. The HUMAN FAMSD-BASE will have an important impact on the progress of biological science.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aspartate-tRNA Ligase / chemistry
  • Databases, Protein
  • Genome, Human
  • Humans
  • Models, Molecular
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / genetics
  • Sequence Alignment / methods*
  • Software
  • Structural Homology, Protein*

Substances

  • Proteins
  • Aspartate-tRNA Ligase