Distribution and molecular evolution of the anti-CRISPR family AcrIF7

PLoS Biol. 2023 Apr 21;21(4):e3002072. doi: 10.1371/journal.pbio.3002072. eCollection 2023 Apr.

Abstract

Anti-clustered regularly interspaced short palindromic repeats (CRISPRs) are proteins capable of blocking CRISPR-Cas systems and typically their genes are located on mobile genetic elements. Since their discovery, numerous anti-CRISPR families have been identified. However, little is known about the distribution and sequence diversity of members within a family, nor how these traits influence the anti-CRISPR's function and evolution. Here, we use AcrIF7 to explore the dissemination and molecular evolution of an anti-CRISPR family. We uncovered 5 subclusters and prevalent anti-CRISPR variants within the group. Remarkably, AcrIF7 homologs display high similarity despite their broad geographical, ecological, and temporal distribution. Although mainly associated with Pseudomonas aeruginosa, AcrIF7 was identified in distinct genetic backgrounds indicating horizontal dissemination, primarily by phages. Using mutagenesis, we recreated variation observed in databases but also extended the sequence diversity of the group. Characterisation of the variants identified residues key for the anti-CRISPR function and other contributing to its mutational tolerance. Moreover, molecular docking revealed that variants with affected function lose key interactions with its CRISPR-Cas target. Analysis of publicly available data and the generated variants suggests that the dominant AcrIF7 variant corresponds to the minimal and optimal anti-CRISPR selected in the family. Our study provides a blueprint to investigate the molecular evolution of anti-CRISPR families.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteriophages* / genetics
  • CRISPR-Cas Systems* / genetics
  • Evolution, Molecular
  • Humans
  • Molecular Docking Simulation
  • Mutation

Grants and funding

W.F. acknowledges funding from Cambridge Trust (10469474) and National Council of Science and Technology-CONACYT (591274 and 706017). A.C. has been supported by the EMBL-EBI/Wellcome Trust Sanger Institute Join Post-Doctoral Fellowship Program (ESPOD). D.C was supported by the PhD scholarship 586079 from National Council of Science and Technology-CONACYT. G.G. acknowledges funding from National Council of Science and Technology-CONACYT CB 255255. Funding for open access charge: University of Cambridge. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.