MILAMP: Multiple Instance Prediction of Amyloid Proteins

IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):1142-1150. doi: 10.1109/TCBB.2019.2936846. Epub 2021 Jun 3.

Abstract

Amyloid proteins are implicated in several diseases such as Parkinson's, Alzheimer's, prion diseases, etc. In order to characterize the amyloidogenicity of a given protein, it is important to locate the amyloid forming hotspot regions within the protein as well as to analyze the effects of mutations on these proteins. The biochemical and biological assays used for this purpose can be facilitated by computational means. This paper presents a machine learning method that can predict hotspot amyloidogenic regions within proteins and characterize changes in their amyloidogenicity due to point mutations. The proposed method called MILAMP (Multiple Instance Learning of AMyloid Proteins) achieves high accuracy for identification of amyloid proteins, hotspot localization, and prediction of mutation effects on amyloidogenicity by integrating heterogenous data sources and exploiting common predictive patterns across these tasks through multiple instance learning. The paper presents comprehensive benchmarking experiments to test the predictive performance of MILAMP in comparison to previously published state of the art techniques for amyloid prediction. The python code for the implementation and webserver for MILAMP is available at the URL: http://faculty.pieas.edu.pk/fayyaz/software.html#MILAMP.

MeSH terms

  • Amyloidogenic Proteins* / chemistry
  • Amyloidogenic Proteins* / genetics
  • Amyloidogenic Proteins* / metabolism
  • Computational Biology / methods*
  • Databases, Protein
  • Humans
  • Machine Learning*
  • Sequence Analysis, Protein

Substances

  • Amyloidogenic Proteins