A quantitative model is developed to predict the Km of 47 human dopamine sulfotransferases by gene expression programming. Each kind of compound is represented by several calculated structural descriptors of moment of inertia A, average electrophilic reactivity index for a C atom, relative number of triple bonds, RNCG relative negative charge, HA-dependent HDSA-1, and HBCA H-bonding charged surface area. Eight fitness functions of the gene expression programming method are used to find the best nonlinear model. The best quantitative model with squared standard error and square of correlation coefficient are 0.096 and 0.91 for training data set, and 0.102 and 0.88 for test set, respectively. It is shown that the gene expression programming-predicted results with fitness function are in good agreement with experimental ones.
© 2011 John Wiley & Sons A/S.