Use of transcriptional sequencing in difficult to read areas of the genome

Anal Biochem. 2003 May 15;316(2):202-7. doi: 10.1016/s0003-2697(03)00054-x.

Abstract

In genome and cDNA sequencing projects, current cycle sequencing often encounters difficult-to-sequence templates which have unique secondary structures due to GC-rich composition or repeated regions. Due to the formation of stable secondary structures, remarkable decreases in fluorescent signals are observed in cycle sequencing reactions. It is not easy to determine the nucleotide sequences of these regions. Although several modifications of sequencing reactions have been tried to overcome these problems, some unreadable regions remain as gaps in genome sequencing projects. Here, we further developed transcriptional sequencing technology and evaluated the sequencing accuracy in these regions. The method was successively applied to artificial GC cluster templates and putative secondary structure-forming templates from genomic and cDNA clones. Our results indicate that transcriptional sequencing is a powerful and accurate method for GC-rich regions, simple sequence repeats, hairpins (inverted repeats), tandem repeat DNA templates, and gap-closing in draft sequencing data.

MeSH terms

  • Animals
  • Base Composition
  • Base Sequence
  • DNA / chemistry
  • DNA / genetics
  • DNA, Complementary / chemistry
  • DNA, Complementary / genetics
  • DNA-Directed DNA Polymerase / chemistry
  • DNA-Directed DNA Polymerase / genetics
  • Fluorescent Dyes
  • Genome*
  • Mice
  • Molecular Sequence Data
  • Oligonucleotides / chemical synthesis
  • Oligonucleotides / genetics*
  • Reproducibility of Results
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*
  • Templates, Genetic
  • Transcription, Genetic*

Substances

  • DNA, Complementary
  • Fluorescent Dyes
  • Oligonucleotides
  • DNA
  • DNA-Directed DNA Polymerase