Improved prediction and characterization of blood-brain barrier penetrating peptides using estimated propensity scores of dipeptides

J Comput Aided Mol Des. 2022 Nov;36(11):781-796. doi: 10.1007/s10822-022-00476-z. Epub 2022 Oct 26.

Abstract

The blood-brain barrier (BBB) is the primary barrier with a highly selective semipermeable border between blood vascular endothelial cells and the central nervous system. Since BBB can prevent drugs circulating in the blood from crossing into the interstitial fluid of the brain where neurons reside, many researchers are working hard on developing drug delivery systems to penetrate the BBB which currently poses a challenge. Thus, blood-brain barrier penetrating peptides (B3PPs) are an alternative neurotherapeutic for brain-related disorder since they can facilitate drug delivery into the brain. In the meanwhile, developing computational methods that are effective for both the identification and characterization of B3PPs in a cost-effective manner plays an important role for basic reach and in the pharmaceutical industry. Even though few computational methods for B3PP identification have been developed, their performance might fail in terms of generalization ability and interpretability. In this study, a novel and efficient scoring card method-based predictor (termed SCMB3PP) is presented for improving B3PP identification and characterization. To overcome the limitation of black-box computational approaches, the SCMB3PP predictor can automatically estimate amino acid and dipeptide propensities to be B3PPs. Both cross-validation and independent tests indicate that SCMB3PP can achieve impressive performance and outperform various popular machine learning-based methods and the existing methods on multiple independent test datasets. Furthermore, SCMB3PP-derived amino acid propensities were utilized to identify informative biophysical and biochemical properties for characterizing B3PPs. Finally, an online user-friendly web server ( http://pmlabstack.pythonanywhere.com/SCMB3PP ) is established to identify novel and potential B3PP cost-effectively. This novel computational approach is anticipated to facilitate the large-scale identification of high potential B3PP candidates for follow-up experimental validation.

Keywords: Bioinformatics; Blood-brain barrier; Machine learning; Penetrating peptides; Propensity score; Scoring card method; Sequence analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / chemistry
  • Blood-Brain Barrier*
  • Dipeptides* / chemistry
  • Dipeptides* / metabolism
  • Endothelial Cells
  • Peptides / metabolism
  • Propensity Score

Substances

  • Dipeptides
  • Peptides
  • Amino Acids