Estimating Copy-Number Proportions: The Comeback of Sanger Sequencing

Genes (Basel). 2021 Feb 17;12(2):283. doi: 10.3390/genes12020283.

Abstract

Determination of the relative copy numbers of mixed molecular species in nucleic acid samples is often the objective of biological experiments, including Single-Nucleotide Polymorphism (SNP), indel and gene copy-number characterization, and quantification of CRISPR-Cas9 base editing, cytosine methylation, and RNA editing. Standard dye-terminator chromatograms are a widely accessible, cost-effective information source from which copy-number proportions can be inferred. However, the rate of incorporation of dye terminators is dependent on the dye type, the adjacent sequence string, and the secondary structure of the sequenced strand. These variable rates complicate inferences and have driven scientists to resort to complex and costly quantification methods. Because these complex methods introduce their own biases, researchers are rethinking whether rectifying distortions in sequencing trace files and using direct sequencing for quantification will enable comparable accurate assessment. Indeed, recent developments in software tools (e.g., TIDE, ICE, EditR, BEEP and BEAT) indicate that quantification based on direct Sanger sequencing is gaining in scientific acceptance. This commentary reviews the common obstacles in quantification and the latest insights and developments relevant to estimating copy-number proportions based on direct Sanger sequencing, concluding that bidirectional sequencing and sophisticated base calling are the keys to identifying and avoiding sequence distortions.

Keywords: CNV; CpG; RNA editing; SNP; base editor; bisulfite sequencing; dye-terminator DNA sequencing; heteroplasmy; replication diode.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • CRISPR-Cas Systems / genetics
  • DNA Copy Number Variations / genetics*
  • Gene Dosage / genetics*
  • INDEL Mutation / genetics
  • Polymorphism, Single Nucleotide / genetics
  • RNA Editing / genetics
  • Sequence Analysis, DNA / trends*
  • Software