Bioinformatics Methods in Predicting Amyloid Propensity of Peptides and Proteins

Methods Mol Biol. 2022:2340:1-15. doi: 10.1007/978-1-0716-1546-1_1.

Abstract

Several computational methods have been developed to predict amyloid propensity of a protein or peptide. These bioinformatics tools are time- and cost-saving alternatives to expensive and laborious experimental methods which are used to confirm self-aggregation of a protein. Computational approaches not only allow preselection of reliable candidates for amyloids but, most importantly, are capable of a thorough and informative analysis of a protein, indicating the sequence determinants of protein aggregation, identifying the potential causal mutations and likely mechanisms. Bioinformatics modeling applies several different approaches, which most typically include physicochemical or structure-based modeling, machine learning, or statistics based modeling. Bioinformatics methods typically use the amino acid sequence of a protein as an input, some also include additional information, for example, an available structure. This chapter describes the methods currently used to computationally predict amyloid propensity of a protein or peptide. Since the accuracy of bioinformatics methods may be highly dependent on reference data used to develop and evaluate the predictors, we also briefly present the main databases of amyloids used by the authors of bioinformatics tools.

Keywords: Amyloid; Classification; Computational prediction; Machine learning; Misfolding.

MeSH terms

  • Amino Acid Sequence
  • Amyloid*
  • Computational Biology*
  • Peptides
  • Protein Aggregates

Substances

  • Amyloid
  • Peptides
  • Protein Aggregates