Similarity/dissimilarity studies of protein sequences based on a new 2D graphical representation

J Comput Chem. 2010 Apr 15;31(5):1045-52. doi: 10.1002/jcc.21391.

Abstract

A (two-dimensional) 2D graphical representation of protein sequences based on six physicochemical properties of amino acids is outlined. The numerical characterization of protein graphs is given as descriptors of protein sequences. It is not only useful for comparative study of proteins but also for encoding innate information about the structure of proteins. The coefficient of determination is proposed as a new similarity/dissimilarity measure. Finally, a simple example is taken to highlight the behavior of the new similarity/dissimilarity measure on protein sequences taken from the ND6 (NADH dehydrogenase subunit 6) proteins for eight different species. The results demonstrate the approach is convenient, fast, and efficient.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry*
  • Animals
  • Computer Graphics
  • Humans
  • Molecular Sequence Data
  • NADH Dehydrogenase / chemistry*
  • Protein Subunits / chemistry
  • Sequence Alignment
  • Sequence Analysis, Protein*

Substances

  • Amino Acids
  • Protein Subunits
  • NADH Dehydrogenase