A pan-CRISPR analysis of mammalian cell specificity identifies ultra-compact sgRNA subsets for genome-scale experiments

Nat Commun. 2022 Feb 2;13(1):625. doi: 10.1038/s41467-022-28045-w.

Abstract

A genetic knockout can be lethal to one human cell type while increasing growth rate in another. This context specificity confounds genetic analysis and prevents reproducible genome engineering. Genome-wide CRISPR compendia across most common human cell lines offer the largest opportunity to understand the biology of cell specificity. The prevailing viewpoint, synthetic lethality, occurs when a genetic alteration creates a unique CRISPR dependency. Here, we use machine learning for an unbiased investigation of cell type specificity. Quantifying model accuracy, we find that most cell type specific phenotypes are predicted by the function of related genes of wild-type sequence, not synthetic lethal relationships. These models then identify unexpected sets of 100-300 genes where reduced CRISPR measurements can produce genome-scale loss-of-function predictions across >18,000 genes. Thus, it is possible to reduce in vitro CRISPR libraries by orders of magnitude-with some information loss-when we remove redundant genes and not redundant sgRNAs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • CRISPR-Cas Systems*
  • Cell Line
  • Clustered Regularly Interspaced Short Palindromic Repeats*
  • Gene Knockout Techniques
  • Genetic Engineering*
  • Genomics
  • Humans
  • Machine Learning*
  • Synthetic Lethal Mutations