A simple method based on Sanger sequencing and MS Word wildcard searching to identify Cas9-induced frameshift mutations

Lab Invest. 2017 Dec;97(12):1500-1507. doi: 10.1038/labinvest.2017.83. Epub 2017 Aug 21.

Abstract

Recent advances in targeted genome editing have enabled sequence-specific modifications in eukaryotic genomes. As it can be easily reprogrammed, the clustered regularly interspaced short palindromic repeat (CRISPR)-Cas9 nuclease system has been studied extensively and is now a widely used genome editing tool. Generally, Cas9 nucleases are designed to target the coding regions in exons of protein-coding genes, which are expected to cause frameshift indel mutations and interrupt protein expression. In such cases, it is often necessary to separate single clones that harbor double frameshift mutant alleles from clones that harbor the wild-type allele or an in-frame mutant allele. We developed a simple and efficient method to identify frameshift mutations in diploid genomes based on Sanger sequencing and MS Word wildcard searching (SWS). As indel mutations induced by Cas9 are varied, Sanger sequencing of PCR products from a single mutant genome will generate double peaks that begin at the indel sites. By positioning the putative sequences deduced from the double peak regions in the sequencing graph onto the wild-type sequence by MS Word wildcard searching, it is possible to predict exactly how many nucleotides were deleted or inserted in each allele of the genome. The SWS strategy greatly facilitates the process of identifying single clones with biallelic frameshift mutations from pooled cells or model organisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • CRISPR-Cas Systems / genetics*
  • Cell Line, Tumor
  • Computational Biology / methods*
  • Data Mining / methods*
  • Frameshift Mutation / genetics*
  • Gene Editing / methods*
  • Humans
  • Sequence Analysis, DNA / methods*