Sub-similarity matching based on data mining with dihedral angles

Int J Comput Biol Drug Des. 2013;6(1-2):131-45. doi: 10.1504/IJCBDD.2013.052207. Epub 2013 Feb 21.

Abstract

Protein sub-similarity matching remains largely unknown even though it is becoming one of the most important open problems in bioinformatics for drug and vaccine design. Variations in human immune responses to vaccines are, and thus responses, fail. We propose a new matching and protein alignment method based on clustering and Longest Common Subsequence (LCS) techniques. After clustering, we found LCS between a candidate protein and meningitis outer membrane antigen for each candidate. Each similarity was scored, and closest similarities were determined with statistical methods. We located three closely matching proteins among a total of 50 human immune system proteins. Moreover, we selected a HIV-1 related protein from one of scenarios, because it revealed a relationship between HIV and meningitis patients. We also found that Ω main chain torsion angle for atoms CA, C and N is the best angle for determining sub-similarities between meningitis antigen and immune proteins.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology / methods*
  • Data Mining / methods*
  • Databases, Protein
  • Drug Design
  • Humans
  • Models, Molecular
  • Proteins / chemistry
  • Reproducibility of Results
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid
  • Support Vector Machine

Substances

  • Proteins