BioCAT: Search for biosynthetic gene clusters producing nonribosomal peptides with known structure

Comput Struct Biotechnol J. 2022 Mar 4:20:1218-1226. doi: 10.1016/j.csbj.2022.02.013. eCollection 2022.

Abstract

Nonribosomal peptides are a class of secondary metabolites synthesized by multimodular enzymes named nonribosomal peptide synthetases and mainly produced by bacteria and fungi. NMR, LC-MS/MS and other analytical methods allow to determine a peptide structure precisely, but it is often not a trivial task to find natural producers of them. There are cases when potential producers should be found among hundreds of strains, for instance, when analyzing metagenomic data. We have developed BioCAT, a tool designed for finding biosynthetic gene clusters which may produce a given nonribosomal peptide when the structure of an interesting nonribosomal peptide has already been found. BioCAT unites the antiSMASH software and the rBAN retrosynthesis tool but some improvements were added to both gene cluster and peptide structure analysis. The main feature of the method is an implementation of a position-specific score matrix to store specificities of nonribosomal peptide synthetase modules, which has increased the alignment sensitivity in comparison with more strict approaches developed earlier. We tested the method on a manually curated nonribosomal peptide producers database and compared it with competing tools GARLIC and Nerpa. Finally, we showed the method's applicability on several external examples.

Keywords: Biosynthetic gene clusters; Nonribosomal peptides; Software.