GAP-Seq: a method for identification of DNA palindromes

BMC Genomics. 2014 May 22;15(1):394. doi: 10.1186/1471-2164-15-394.

Abstract

Background: Closely spaced long inverted repeats, also known as DNA palindromes, can undergo intrastrand annealing to form DNA hairpins. The ability to form these hairpins results in genome instability, difficulties in maintaining clones in Escherichia coli and major problems for most DNA sequencing approaches. Because of their role in genomic instability and gene amplification in some human cancers, it is important to develop systematic approaches to detect and characterize DNA palindromes.

Results: We developed a new protocol to identify palindromes that couples the S1 nuclease treated Cot0 DNA (GAPF) with high-throughput sequencing (GAP-Seq). Unlike earlier protocols, it does not involve restriction enzymatic digestion prior to DNA snap-back thereby preserving longer DNA sequences. It also indicates the location of the novel junction, which can then be recovered. Using MCF-7 breast cancer cell line as the proof-of-principle analysis, we have identified 35 palindrome candidates and physically characterized the top 5 candidates and their junctions. Because this protocol eliminates many of the false positives that plague earlier techniques, we have improved palindrome identification.

Conclusions: The GAP-Seq approach underscores the importance of developing new tools for identifying and characterizing palindromes, and provides a new strategy to systematically assess palindromes in genomes. It will be useful for studying human cancers and other diseases associated with palindromes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology
  • DNA / genetics*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • MCF-7 Cells
  • Polymerase Chain Reaction

Substances

  • DNA

Associated data

  • GEO/GSE43679
  • SRA/SRA064847
  • SRA/SRA065361