Sequence-based analysis and prediction of lantibiotics: A machine learning approach

Comput Biol Chem. 2018 Dec:77:199-206. doi: 10.1016/j.compbiolchem.2018.10.004. Epub 2018 Oct 9.

Abstract

Lantibiotics, an important group of ribosomally synthesized peptides, represent an important arsenal of novel promising antimicrobials showing high potency in fighting against the prevalence of antibiotic resistance among microbial pathogens. However, due to the lack of high throughput strategies for the isolation and identification of these compounds, our information regarding their structure and especially sequence-based properties is far from complete. Therefore, in the present study, a comprehensive sequence-based analysis of these peptides was performed with the help of machine learning approach together with a feature selection technique. Meanwhile, an attempt to develop an accurate computational model for prediction of lantibiotics was made via constructing two datasets of 280 and 190 lantibiotic and non-lantibiotic antimicrobial peptide sequences, respectively. Based on the conducted approach and as a result of our search for a subset of relevant features of lantibiotics, particular types of sequenced-based features were observed to be preferred in lantibiotics, the knowledge-based implementation of which can be used as strategies for lantibiotic bioengineering purposes. Moreover, a SMO-based classifier was developed for the prediction of lantibiotics with the accuracy and specificity values of 88.5% and 94%, respectively which shows the great potential of the developed algorithm for the prediction of lantibiotcs. Conclusively, the accurate predictor algorithm as well as the identified sequence-based distinctiveness properties of lantibiotics can give valuable information in both the fields of lantibiotic discovery and bioengineering.

Keywords: Antimicrobial peptides; Feature selection; Lanthipeptides; Peptide design; Support vector machine.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Anti-Bacterial Agents / chemistry*
  • Bacteriocins / chemistry*
  • Computer-Aided Design
  • Drug Design
  • Glutamic Acid / chemistry
  • Leucine / chemistry
  • Machine Learning*

Substances

  • Anti-Bacterial Agents
  • Bacteriocins
  • Glutamic Acid
  • Leucine