High-Throughput Analysis of DNA Break-Induced Chromosome Rearrangements by Amplicon Sequencing

Methods Enzymol. 2018:601:111-144. doi: 10.1016/bs.mie.2017.11.028. Epub 2018 Feb 21.

Abstract

The mechanistic understanding of how DNA double-strand breaks (DSB) are repaired is rapidly advancing in part due to the advent of inducible site-specific break model systems as well as the employment of next-generation sequencing (NGS) technologies to sequence repair junctions at high depth. Unfortunately, the sheer volume of data produced by these methods makes it difficult to analyze the structure of repair junctions manually or with other general-purpose software. Here, we describe methods to produce amplicon libraries of DSB repair junctions for sequencing, to map the sequencing reads, and then to use a robust, custom python script, Hi-FiBR, to analyze the sequence structure of mapped reads. The Hi-FiBR analysis processes large data sets quickly and provides information such as number and type of repair events, size of deletion, size of insertion and inserted sequence, microhomology usage, and whether mismatches are due to sequencing error or biological effect. The analysis also corrects for common alignment errors generated by sequencing read mapping tools, allowing high-throughput analysis of DSB break repair fidelity to be accurately conducted regardless of which suite of NGS analysis software is available.

Keywords: Alternative end joining; Amplicon; DNA double-strand break; Hi-FiBR; High-throughput sequencing; Homologous recombination; Microhomology; Nonhomologous end joining; Read alignment; Rearrangement; Repair junction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • DNA / metabolism
  • DNA Breaks, Double-Stranded*
  • DNA End-Joining Repair*
  • Genetic Techniques
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Recombinational DNA Repair*
  • Sequence Analysis, DNA / methods*

Substances

  • DNA