A method to distinguish between lysine acetylation and lysine methylation from protein sequences

J Theor Biol. 2012 Oct 7:310:223-30. doi: 10.1016/j.jtbi.2012.06.030. Epub 2012 Jul 11.

Abstract

Lysine acetylation and methylation are two major post-translational modifications of lysine residues. They play vital roles in both biological and pathological processes. Specific lysine residues in H3 histone protein tails appear to be targeted for either acetylation or methylation. Hence it is very challenging to distinguish between acetylated and methylated lysine residues using computational methods. This work presents a method that incorporates protein sequence information, secondary structure and amino acid properties to differentiate acetyl-lysine from methyl-lysine. We apply an encoding scheme based on grouped weight and position weight amino acid composition to extract sequence information and physicochemical properties around lysine sites. The proposed method achieves an accuracy of 93.3% using a jackknife test. Feature analysis demonstrates that the prediction model with multiple features can take full advantage of the supplementary information from different features to improve classification performance and prediction robustness. Analysis of the characteristics of lysine residues which can be either methylated or acetylated shows that they are more similar to methyl-lysine than to acetyl-lysine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acetylation
  • Amino Acid Sequence
  • Amino Acids / metabolism
  • Databases, Protein
  • Lysine / metabolism*
  • Methylation
  • Molecular Sequence Data
  • Position-Specific Scoring Matrices
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods*
  • Support Vector Machine

Substances

  • Amino Acids
  • Proteins
  • Lysine