High-Density Guide RNA Tiling and Machine Learning for Designing CRISPR Interference in Synechococcus sp. PCC 7002

ACS Synth Biol. 2023 Apr 21;12(4):1175-1186. doi: 10.1021/acssynbio.2c00653. Epub 2023 Mar 9.

Abstract

While CRISPRi was previously established in Synechococcus sp. PCC 7002 (hereafter 7002), the design principles for guide RNA (gRNA) effectiveness remain largely unknown. Here, 76 strains of 7002 were constructed with gRNAs targeting three reporter systems to evaluate features that impact gRNA efficiency. Correlation analysis of the data revealed that important features of gRNA design include the position relative to the start codon, GC content, protospacer adjacent motif (PAM) site, minimum free energy, and targeted DNA strand. Unexpectedly, some gRNAs targeting upstream of the promoter region showed small but significant increases in reporter expression, and gRNAs targeting the terminator region showed greater repression than gRNAs targeting the 3' end of the coding sequence. Machine learning algorithms enabled prediction of gRNA effectiveness, with Random Forest having the best performance across all training sets. This study demonstrates that high-density gRNA data and machine learning can improve gRNA design for tuning gene expression in 7002.

Keywords: CRISPRi; Synechococcus; Synechococcus sp. PCC 7002; cyanobacteria; gRNA design; machine learning.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • CRISPR-Cas Systems* / genetics
  • Clustered Regularly Interspaced Short Palindromic Repeats / genetics
  • DNA
  • RNA
  • Synechococcus* / genetics

Substances

  • DNA
  • RNA