Distribution and phasing of sequence motifs that facilitate CRISPR adaptation

Curr Biol. 2021 Aug 23;31(16):3515-3524.e6. doi: 10.1016/j.cub.2021.05.068. Epub 2021 Jun 25.

Abstract

CRISPR-associated proteins (Cas1 and Cas2) integrate foreign DNA at the "leader" end of CRISPR loci. Several CRISPR leader sequences are reported to contain a binding site for a DNA-bending protein called integration host factor (IHF). IHF-induced DNA bending kinks the leader of type I-E CRISPRs, recruiting an upstream sequence motif that helps dock Cas1-2 onto the first repeat of the CRISPR locus. To determine the prevalence of IHF-directed CRISPR adaptation, we analyzed 15,274 bacterial and archaeal CRISPR leaders. These experiments reveal multiple IHF binding sites and diverse upstream sequence motifs in a subset of the I-C, I-E, I-F, and II-C CRISPR leaders. We identify subtype-specific motifs and show that the phase of these motifs is critical for CRISPR adaptation. Collectively, this work clarifies the prevalence and mechanism(s) of IHF-dependent CRISPR adaptation and suggests that leader sequences and adaptation proteins may coevolve under the selective pressures of foreign genetic elements like plasmids or phages.

Keywords: CRISPR; CRISPR adaptation; Cas1-2; Cas1-2/3; DNA integration; DNA looping; DNA phase; IHF.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • CRISPR-Associated Proteins* / genetics
  • CRISPR-Associated Proteins* / metabolism
  • CRISPR-Cas Systems
  • Clustered Regularly Interspaced Short Palindromic Repeats*
  • DNA
  • Endonucleases / genetics
  • Endonucleases / metabolism

Substances

  • CRISPR-Associated Proteins
  • DNA
  • Endonucleases