Structural coordinates: A novel approach to predict protein backbone conformation

PLoS One. 2021 May 20;16(5):e0239793. doi: 10.1371/journal.pone.0239793. eCollection 2021.

Abstract

Motivation: Local protein structure is usually described via classifying each peptide to a unique class from a set of pre-defined structures. These classifications may differ in the number of structural classes, the length of peptides, or class attribution criteria. Most methods that predict the local structure of a protein from its sequence first rely on some classification and only then proceed to the 3D conformation assessment. However, most classification methods rely on homologous proteins' existence, unavoidably lose information by attributing a peptide to a single class or suffer from a suboptimal choice of the representative classes.

Results: To alleviate the above challenges, we propose a method that constructs a peptide's structural representation from the sequence, reflecting its similarity to several basic representative structures. For 5-mer peptides and 16 representative structures, we achieved the Q16 classification accuracy of 67.9%, which is higher than what is currently reported in the literature. Our prediction method does not utilize information about protein homologues but relies only on the amino acids' physicochemical properties and the resolved structures' statistics. We also show that the 3D coordinates of a peptide can be uniquely recovered from its structural coordinates, and show the required conditions under various geometric constraints.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Humans
  • Protein Conformation*
  • Sequence Analysis, Protein / methods*

Grants and funding

VM has been partially supported by the DFG FOR 2715. YM, AN, YK, VT and NE have been partially supported by the Russian Foundation for Basic Research grant 20-04-01085.