Nonlinear methods in the analysis of protein sequences: a case study in rubredoxins

Biophys J. 2000 Jan;78(1):136-49. doi: 10.1016/S0006-3495(00)76580-5.

Abstract

Two computational methods widely used in time series analysis were applied to protein sequences, and their ability to derive structural information not directly accessible through classical sequence comparisons methods was assessed. The primary structures of 19 rubredoxins of both mesophilic and thermophilic bacteria, coded with hydrophobicity values of amino acid residues, were considered as time series and were analyzed by 1) recurrence quantification analysis and 2) spectral analysis of the sequence major eigenfunctions. The results of the two methods agreed to a large extent and generated a classification consistent with known 3D structural characteristics of the studied proteins. This classification separated in a clearcut manner a thermophilic protein from mesophilic proteins. The classification of primary structures given by the two dynamical methods was demonstrated to be basically different from classification stemming from classical sequence homology metrics. Moreover, on a more detailed scale, the method was able to discriminate between thermophilic and mesophilic proteins from a set of chimeric sequences generated from the mixing of a mesophilic (Rubr Clopa) and a thermophilic (Rubr Pyrfu) protein. Overall, our results point to a new way of looking at protein sequence comparisons.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Models, Molecular
  • Molecular Sequence Data
  • Protein Structure, Tertiary
  • Rubredoxins / chemistry*
  • Sequence Alignment
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid
  • Software

Substances

  • Bacterial Proteins
  • Rubredoxins