Target Specificity of the CRISPR-Cas9 System in Arabidopsis thaliana, Oryza sativa, and Glycine max Genomes

J Comput Biol. 2020 Oct;27(10):1544-1552. doi: 10.1089/cmb.2019.0453. Epub 2020 Apr 16.

Abstract

Clustered regularly interspaced short palindromic repeats (CRISPR), a class of immune-associated sequences in bacteria, have been developed as a powerful tool for editing eukaryotic genomes in diverse cells and organisms in recent years. The CRISPR-Cas9 system can recognize upstream 20 nucleotides (guide sequence) adjacent to the protospacer-adjacent motif site and trigger double-stranded DNA cleavage as well as DNA repair mechanisms, which eventually result in knockout, knockin, or site-specific mutagenesis. However, off-target effect caused by guide sequence misrecognition is the major drawback and restricts its widespread application. In this study, global analysis of specificities of all guide sequences in Arabidopsis thaliana, Oryza sativa (rice), and Glycine max (soybean) were performed. As a result, a simple pipeline and three genome-wide databases were established and shared for the scientific society. For each target site of CRISPR-Cas9, specificity score and off-target number were calculated and evaluated. The mean values of off-target numbers for A. thaliana, rice, and soybean were determined as 27.5, 57.3, and 174.7, respectively. Comparative analysis among these plants suggested that the frequency of off-target effects was correlated to genome size, chromosomal locus, gene density, and guanine-cytosine (GC) content. Our results contributed to the better understanding of CRISPR-Cas9 system in plants and would help to minimize the off-target effect during its applications in the future.

Keywords: Arabidopsis thaliana; CRISPR-Cas9; Glycine max; Oryza sativa; off-target effect; specificity.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • CRISPR-Cas Systems*
  • Computational Biology
  • Gene Editing / methods
  • Gene Targeting / adverse effects
  • Gene Targeting / methods
  • Genome, Plant
  • Glycine max / genetics*
  • Oryza / genetics*
  • Plant Breeding / methods
  • Plants, Genetically Modified / genetics
  • RNA, Guide, CRISPR-Cas Systems / genetics

Substances

  • RNA, Guide, CRISPR-Cas Systems