Illuminating the origins of spectral properties of green fluorescent proteins via proteochemometric and molecular modeling

J Comput Chem. 2014 Oct 15;35(27):1951-66. doi: 10.1002/jcc.23708. Epub 2014 Aug 12.

Abstract

Green fluorescent protein (GFP) has immense utility in biomedical imaging owing to its autofluorescent nature. In efforts to broaden the spectral diversity of GFP, there have been several reports of engineered mutants via rational design and random mutagenesis. Understanding the origins of spectral properties of GFP could be achieved by means of investigating its structure-activity relationship. The first quantitative structure-property relationship study for modeling the spectral properties, particularly the excitation and emission maximas, of GFP was previously proposed by us some years ago in which quantum chemical descriptors were used for model development. However, such simplified model does not consider possible effects that neighboring amino acids have on the conjugated π-system of GFP chromophore. This study describes the development of a unified proteochemometric model in which the GFP chromophore and amino acids in its vicinity are both considered in the same model. The predictive performance of the model was verified by internal and external validation as well as Y-scrambling. Our strategy provides a general solution for elucidating the contribution that specific ligand and protein descriptors have on the investigated spectral property, which may be useful in engineering novel GFP variants with desired characteristics.

Keywords: chromophore; data mining; green fluorescent protein; molecular modeling; proteochemometrics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / analysis
  • Amino Acids / chemistry*
  • Computational Biology
  • Green Fluorescent Proteins / chemistry*
  • Green Fluorescent Proteins / genetics
  • Ligands
  • Models, Molecular*
  • Protein Engineering
  • Structure-Activity Relationship

Substances

  • Amino Acids
  • Ligands
  • Green Fluorescent Proteins