Computational approaches for protein function prediction: a combined strategy from multiple sequence alignment to molecular docking-based virtual screening

Biochim Biophys Acta. 2010 Sep;1804(9):1695-712. doi: 10.1016/j.bbapap.2010.04.008. Epub 2010 Apr 28.

Abstract

The functional characterization of proteins represents a daily challenge for biochemical, medical and computational sciences. Although finally proved on the bench, the function of a protein can be successfully predicted by computational approaches that drive the further experimental assays. Current methods for comparative modeling allow the construction of accurate 3D models for proteins of unknown structure, provided that a crystal structure of a homologous protein is available. Binding regions can be proposed by using binding site predictors, data inferred from homologous crystal structures, and data provided from a careful interpretation of the multiple sequence alignment of the investigated protein and its homologs. Once the location of a binding site has been proposed, chemical ligands that have a high likelihood of binding can be identified by using ligand docking and structure-based virtual screening of chemical libraries. Most docking algorithms allow building a list sorted by energy of the lowest energy docking configuration for each ligand of the library. In this review the state-of-the-art of computational approaches in 3D protein comparative modeling and in the study of protein-ligand interactions is provided. Furthermore a possible combined/concerted multistep strategy for protein function prediction, based on multiple sequence alignment, comparative modeling, binding region prediction, and structure-based virtual screening of chemical libraries, is described by using suitable examples. As practical examples, Abl-kinase molecular modeling studies, HPV-E6 protein multiple sequence alignment analysis, and some other model docking-based characterization reports are briefly described to highlight the importance of computational approaches in protein function prediction.

Publication types

  • Review

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Computational Biology*
  • Humans
  • Molecular Sequence Data
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Sequence Homology, Amino Acid

Substances

  • Proteins