Deep sequencing to reveal new variants in pooled DNA samples

Hum Mutat. 2009 Dec;30(12):1703-12. doi: 10.1002/humu.21122.

Abstract

We evaluated massive parallel sequencing and long-range PCR (LRP) for rare variant detection and allele frequency estimation in pooled DNA samples. Exons 2 to 16 of the MUTYH gene were analyzed in breast cancer patients with Illumina's (Solexa) technology. From a pool of 287 genomic DNA samples we generated a single LRP product, while the same LRP was performed on 88 individual samples and the resulting products then pooled. Concentrations of constituent samples were measured with fluorimetry for genomic DNA and high-resolution melting curve analysis (HR-MCA) for LRP products. Illumina sequencing results were compared to Sanger sequencing data of individual samples. Correlation between allele frequencies detected by both methods was poor in the first pool, presumably because the genomic samples amplified unequally in the LRP, due to DNA quality variability. In contrast, allele frequencies correlated well in the second pool, in which all expected alleles at a frequency of 1% and higher were reliably detected, plus the majority of singletons (0.6% allele frequency). We describe custom bioinformatics and statistics to optimize detection of rare variants and to estimate required sequencing depth. Our results provide directions for designing high-throughput analyses of candidate genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • DNA / genetics*
  • Gene Frequency / genetics
  • Genetic Variation / genetics*
  • Genome, Human / genetics
  • Humans
  • Nucleic Acid Denaturation / genetics
  • Polymerase Chain Reaction
  • Sequence Analysis, DNA / methods*

Substances

  • DNA